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Description 

BACKGROUND OF THE INVENTION 
s 1. Field of the Invention 

[00011 The present invention relates to novel polynucleotides derived from microorganisms belonging to coryneform 
bacteria and fragments thereof, polypeptides encoded by the polynucleotides and fragments thereof, polynucleotide 
arrays comprising the polynucleotides and fragments thereof, computer readable recording media in which the nucle- 
io otide sequences of the polynucleotide and fragments thereof have been recorded, and use of them as well as a method 
of using the polynucleotide and/or polypeptide sequence information to make comparisons. 

2. Brief Description of the Background Art 

is [00021 Coryneform bacteria are used in producing various useful substances, such as amino acids, nucleic acids, 
vitamins saccharides (for example, ribulose), organic acids (for example, pyruvic acid), and analogues of the above- 
described substances (for example, N-acetylamino acids) and are very useful microorganisms industrially. Many mu- 

SoospBar "xampte^Corynebacterium glutamicum is a Gram-positive bacterium identified as a glutamic acid-pro- 
20 ducing bacterium, and many amino acids are produced by mutants thereof. For example, 1 ,000.000 ton/year of L- 
glutamic acid which is useful as a seasoning for umami (delicious taste), 250.000 ton/year of L-iysine which Is a valuable 
additive for livestock feeds and the like, and several hundred ton/year or more of other amino acids, such as L-arg nine, 
L-proline, L-glutamine, L-tryptophan, and the like, have been produced in the world (Nikkei Bio Yearbook 99, published 
by NiKKsi BP (1998)) 

25 [0004] The production of amino acids by Corynebacterium glutamicum is mainly carried out by its mutants (metabolic 
mutants) which have a mutated metabolic pathway and regulatory systems. In general, an organism is provided with 
various metabolic regulatory systems so as not to produce more amino acids than it needs. In the biosynthesis of L- 
lysine for example, a microorganism belonging to the genus Corynebacterium is under such regulation as preventing 
the excessive production by concerted inhibition by lysine and threonine against the activity of a biosynthesis enzyme 
so common to lysine, threonine and methionine, i.e., an aspartokinase, (J. Biochem., 65: 849-8S9 (1 969)). The biosyn- 
thesis of arginine is controlled by repressing the expression of its biosynthesis gene by arginine so as not to b.osyn- 
thesize an excessive amount of arginine (Microbiology, 142: 99-108 (1996)). It is considered that these metabolic 
regulatory mechanisms are deregulated in amino acid-producing mutants. Similarly, the metabolic regulation is dereg- 
ulated in mutants producing nucleic acids, vitamins, saccharides, organic acids and analogues of the above-descnbed 
as substances so as to improve the productivity of the objective product. 

[00051 However accumulation of basic genetic, biochemical and molecular biological data on coryneform bacteria 
is insufficient in comparison with Escherichia coll, Bacillus subtilis, and the like. Also, few findings have been obtained 
on mutated genes in amino acid-producing mutants. Thus, there are various mechanisms, which are still unknown, of 
regulating the growth and metabolism of these microorganisms. 
40 [00061 A chromosomal physical map of Corynebacterium glutamicum ATCC 1 3032 is reported and it is known that 
its genome size is about 3,1 00 kb (Mol. Gen. Genet, 252:. 255-265 (1 996)). Calculating on the basis of the usual gene 
density of bacteria, it is presumed that about 3,000 genes are present in this genome of about 3.100 kb. However, only 
about 100 genes mainly concerning amino acid biosynthesis genes are known In Corynebacterium glutamicum, and 
the nucleotide sequences of most genes have not been clarified hitherto. 
45 [00071 In recent years, the full nucleotide sequence of the genomes of several microorganisms, such as Escherichia 
coil Mycobacterium tuberculosis, yeast, and the like, have been determined (Science, 277: 1453-62 (1997); Nature, 
393: 537-544 (1 998)- Nature, 387. 5-1 05 (1 997)). Based on the thus determined full nucleotide sequences, assumption 
of gene regions and prediction of their function by comparison with the nucleotide sequences of known genes have 
been carried out. Thus, the functions of a great number of genes have been presumed, without genetic, biochemical 
so or molecular biological experiments. „_ 
[0008] In recent years, moreover, techniques for monitoring expression levels of a great number of genes simulta- 
neously or detecting mutations, using DNA chips. DNA arrays or the like in which a partial nucleic acid fragment of a 
gene or a partial nucleic acid fragment in genomic DNA other than a gene is fixed to a solid support, have been 
developed The techniques contribute to the analysis of microorganisms, such as yeasts, Mycobactenum tuberculosis, 
55 Mycobacterium bovis used in BCG vaccines, and the like (Science, 278: 680-686 (1 997); Proc. Natl. Acad. Sci. USA, 
96. 12833-38 (1999); Science, 284: 1520-23 (1999)). 
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SUMMARY OF THE INVENTION 

[0009] An object of the present Invention is to provide a polynucleotide and a polypeptide derived from a microor- 
ganism of coryneform bacteria which are industrially useful, sequence information of the polynucleotide and the 
5 polypeptide, a method for analyzing the microorganism, an apparatus and a system for use in the analysis, and a 
method for breeding the microorganism. 

[0010] The present invention provides a polynucleotide and an oligonucleotide derived from a microorganism be- 
longing to coryneform bacteria, oligonucleotide arrays to which the polynucleotides and the oligonucleotides are fixed, 
a polypeptide encoded by the polynucleotide, an antibody which recognizes the polypeptide, polypeptide arrays to 
10 which the polypeptides or the antibodies are fixed, a computer readable recording medium in which the nucleotide 
sequences of the polynucleotide and the oligonucleotide and the amino acid sequence of the polypeptide have been 
recorded, and a system based on the computer using the recording medium as well as a method of using the polynu- 
cleotide and/or polypeptide sequence information to make comparisons. 

15 BRIEF DESCRIPTION OF THE DRAWING 

[0011] Fig. 1 is a map showing the positions of typical genes on the genome of Cory nebacte hum glutamicum ATCC 
13032. 

[0012] Fig. 2 Is electrophoresis showing the results of proteome analyses using proteins derived from (A) Coryne- 
20 bacterium glutamicum ATCC 13032, (B) FERM BP-7134, and (C) FERM BP-158. 

[001 3] Fig. 3 is a flow chart of an example of a system using the computer readable media according to the present 
invention. 

[0014] Fig. 4 Is a flow chart of an example of a system using the computer readable media according to the present 
invention. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

[0015] This application is based on Japanese applications No. Hei. 11-377484 filed on December 16, 1999, No. 
2000-159162 filed on April 7, 2000 and No. 2000-280988 filed on August 3, 2000, the entire contents of which are 

30 incorporated hereinto by reference. 

[0016] From the viewpoint that the determination of the full nucleotide sequence of Corynebacterium glutamicum 
would make it possible to specify gene regions which had not been previously identified, to determine the function of 
an unknown gene derived from the microorganism through comparison with nucleotide sequences of known genes 
and amino acid sequences of known genes, and to obtain a useful mutant based on the presumption of the metabolic 

35 regulatory mechanism of a useful product by the microorganism, the inventors conducted intensive studies and, as a 
result, found that the complete genome sequence of Corynebacterium glutamicum can be determined by applying the 
whole genome shotgun method. 

[0017] Specifically, the present invention relates to the following (1 ) to (65): 
40 (1) A method for at least one of the following: 

(A) identifying a mutation point of a gene derived from a mutant of a coryneform bacterium, 

(B) measuring an expression amount of a gene derived from a coryneform bacterium, 

(C) analyzing an expression profile of a gene derived from a coryneform bacterium, 
43 (D) analyzing expression patterns of genes derived from a coryneform bacterium, or 

(E) identifying a gene homologous to a gene derived from a coryneform bacterium, 
said method comprising: 

(a) producing a polynucleotide array by adhering to a solid support at least two polynucleotides selected 
£0 from the group consisting of first polynucleotides comprising the nucleotide sequence represented by any 

one of SEQ ID NOS: 1 to 3501 , second polynucleotides which hybridize with the first polynucleotides under 
stringent conditions, and third polynucleotides comprising a sequence of 10 to 200 continuous bases of 
the first or second polynucleotides, _ _ 

(b) incubating the polynucleotide array with at least one of a labeled polynucleotide derived from a co- 
55 ryneform bacterium, a labeled polynucleotide derived from a mutant of the coryneform bacterium or a 

labeled polynucleotide to be examined, under hybridization conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 
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m f rt r ovamnie the at least two polynucleotides can be at least two of the first polynu- 

two of the first, second and third polynucleotides. 
(2) The method according to (1), wherein the coryneform bacterium is a microorganism be.ong.ng to the genus 

(6) A polynucleotide array, comprising: 

tlnuous bases of the first or second polynucleotides, and 
a solid support adhered thereto. 

second and third polynucleotides. rQ „„ cont ort h« «5EO ID N0 1 or a polynucleotide having 

(7) A polynucleotide comprising the nucleotide sequence represented by SEQ ID NO.l or a po.yn 

Nn<t-rae to 6931 or a polynucleotide which hybridizes therewith under stnngent cond.tions 
comprising 10 to 200 continuous based. 

M2^ A recombinant DNA comprising the polynucleotide of any one of (8) to (11). 

l (13) £ t^ the polynucleotide of any one of (8) to (11) or the recombinant DNA of (12). 

(14) A method for producing a polypeptide, comprising: 

culturing the transformant of (13) in a medium to produce and accumulate a polypeptide encoded by the 
polynucleotide of (8) or (9) in the medium, and 
45 recovering the polypeptide from the medium. 

(1 5) A method for producing at least one of an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, 
and analogues thereof, comprising: 

culturing the transformant of (13) in a medium to produce and accumulate at least one of an i amino acid, a 

and analogues thereof from the medium. 

(1 6) A polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from SEQ ID NOS: 
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added, said polypeptides having an activity which is substantially the same as that of the polypeptide without said 
at least one amino acid deletion, replacement, insertion or addition. 

(19) A polypeptide comprising an amino acid sequence having a homology of at least 60% with the amino acid 
sequence of the polypeptide of (16) or (17), and having an activity which is substantially the same as that of the 
polypeptide. 

(20) An antibody which recognizes the polypeptide of any one of (1 6) to (1 9). 

(21) A polypeptide array, comprising: 

at least one polypeptide or partial fragment polypeptide selected from the polypeptides of (16) to (19) and 
partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

(22) A polypeptide array, comprising: 

at least one antibody which recognizes a polypeptide or partial fragment polypeptide selectedf rom the polypep- 
tides of (1 6) to (1 9) and partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

(23) A system based on a computer for identifying a target sequence or a target structure motif derived from a 
coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence Information selected from SEQ ID NOS:1 
to 3501 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input Information; 

(iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 
1 to 3501 with the target sequence or target structure motif information, recorded by the data storage device 
for screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

(24) A method based on a computer for identifying a target sequence or a target structure motif derived from a 
coryneform bacterium, comprising the following: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501, target se- 
quence information or target structure motif information into a user input device; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence Information selected from SEQ ID NOS:1 to 3501 with 
the target sequence or target structure motif information; and 

(iv) screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
target sequence or target structure motif information. 

(25) A system based on a computer for identifying a target sequence or a target structure motif derived from a 
coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 

3502 to 7001 , and target sequence or target structure motif information; 

(Ii) a data storage device for at least temporarily storing the Input information; 

(iii) a comparator that compares the at least one amino acid sequence Information selected from SEQ ID NOS: 
3502 to 7001 with the target sequence or target structure motif information, recorded by the data storage 
device for screening and analyzing amino acid sequence information which is coincident with or analogous to 
the target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

(26) A method based on a computer for identifying a target sequence or a target structure motif derived from a 
coryneform bacterium, comprising the following: 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , and target 
sequence information or target structure motif information into a user input device; 
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(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS.3502 to 7001 
with the target sequence or target structure motif information; and 

(iv) screening and analyzing amino acid sequence information which is coincident with or analogous to the 
target sequence or target structure motif information. 

(27) A system based on a computer for determining a function of a polypeptide encoded by a polynucleotide having 
a target nucleotide sequence derived from a coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID NOS:2 
to 3501, function information of a polypeptide encoded by the nucleotide sequence, and target nucleotide 
sequence information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 
2 to 3501 with the target nucleotide sequence information, and determining a function of a polypeptide encoded 
by a polynucleotide having the target nucleotide sequence which is coincident with or analogous to the poly- 
nucleotide having at least one nucleotide sequence selected from SEQ ID NOS:2 to 3501 ; and 

(iv) an output devices that shows a function obtained by the comparator. 

(28) A method based on a computerfor determining a function of a polypeptide encoded by a polypeptide encoded 
by a polynucleotide having a target nucleotide sequence derived from a coryneform bacterium, comprising the 
following: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501, function in- 
formation of a polypeptide encoded by the nucleotide sequence, and target nucleotide sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501 with 
the target nucleotide sequence information; and 

(iv) determining a function of a polypeptide encoded by a polynucleotide having the target nucleotide sequence 
which is coincident with or analogous to the polynucleotide having at least one nucleotide sequence selected 
from SEQ ID NOS:2 to 3501 . 

(29) A system based on a computerfor determining afunction of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , function information based on the amino acid sequence, and target amino acid sequence infor- 
mation; 

(ii) a data storing device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target amino acid sequence information for determining a function of a polypeptide 
having the target amino acid sequence which is coincident with or analogous to the polypeptide having at least 
one amino acid sequence selected from SEQ ID NOS:3502 to 7001 ; and 

(iv) an output device that shows a function obtained by the comparator. 

(30) A method based on a computerfor determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , function 
information based on the amino acid sequence, and target amino acid sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
with the target amino acid sequence information; and 

(iv) determining a function of a polypeptide having the target amino acid sequence which is coincident with or 
analogous to the polypeptide having at least one amino acid sequence selected from SEQ ID NOS:3502 to 
7001. 

(31) The system according to any one of (23), (25), (27) and (29), wherein a coryneform bacterium is a microor- 
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ganism of the genus Corynebacterium, the genus Brevibacterium, or the genus Mlcrobacterium. 

(32) The method according to any one of (24), (26), (28) and (30), wherein a coryneform bacterium Is a microor- 
ganism of the genus Corynebacterium, the genus Brevibacterium, or the genus Mlcrobacterium. 

(33) The system according to (31 ), wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium giutamicum, Corynebacterium acetoacidophiium, Corynebacterium 
acetoglutamicum, corynebacterium caiiunae, corynebacterium hercuiis, Corynebacterium iiiium, Corynebacterium 
melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(34) The method according to (32), wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium giutamicum, Corynebacterium acetoacidophiium, Corynebacterium 
acetogiutamicum, Corynebacterium caiiunae, Corynebacterium hercuiis, Corynebacterium iiiium, Corynebacteri- 
um meiassecoia, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(35) A recording medium or storage device which is readable by a computer in which at least one nucleotide 
sequence information selected from SEQ ID NOS:1 to 3501 or function information based on the nucleotide se- 
quence is recorded, and is usable in the system of (23) or (27) or the method of (24) or (28). 

(36) A recording medium or storage device which is readable by a computer in which at least one amino acid 
sequence information selected from SEQ ID NOS:3502 to 7001 or function information based on the amino acid 
sequence is recorded, and is usable in the system of (25) or (29) or the method of (26) or (30). 

(37) The recording medium or storage device according to 

(35) or (36), which is a computer readable recording medium selected from the group consisting of a floppy disc, 
a hard disc, a magnetic tape, a random access memory (RAM), a read only memory (ROM), a magneto-optic disc 
(MO), CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM and DVD-RW. 

(38) A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the 
Val residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a coryneform 
bacterium is replaced with an amino acid residue other than a Val residue. 

(39) A polypeptide comprising an amino acid sequence in which the Val residue at the 59th position in the amino 
acid sequence as represented by SEQ ID NO:6952 is replaced with an amino acid residue other than a Val residue. 

(40) The polypeptide according to (38) or (39), wherein the Val residue at the 59th position is replaced with an Ala 
residue. 

(41) A polypeptide having pyruvate carboxylase activity, comprising an amino acid sequence in which the Pro 
residue at the 458th position in the amino acid sequence of pyruvate carboxylase derived from a coryneform 
bacterium is replaced with an amino acid residue other than a Pro residue. 

(42) A polypeptide comprising an amino acid sequence in which the Pro residue at the 45Bth position in the amino 
acid sequence represented by SEQ ID NO:4265 is replaced with an amino acid residue other than a Pro residue. 

(43) The polypeptide according to (41 ) or (42), wherein the Pro residue at the 458th position is replaced with a Ser 
residue. 

(44) The polypeptide according to any one of (38) to (43), which is derived from Corynebacterium giutamicum. 

(45) A DNA encoding the polypeptide of any one of (38) to (44). 

(46) A recombinant DNA comprising the DNA of (45). 

(47) A transformant comprising the recombinant DNA of (46). 

(48) A transformant comprising in Its chromosome the DNA of (45). 

(49) The transformant according to (47) or (48), which Is derived from a coryneform bacterium. 

(50) The transformant according to (49), which is derived from Corynebacterium giutamicum. 

(51) A method for producing L-lysine, comprising: 

culturing the transformant of any one of (47) to (50) In a medium to produce and accumulate L-lysine In the 
medium, and 

recovering the L-lysine from the culture. 

(52) A method for breeding a coryneform bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:1 to 3431 , comprising the following: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 
an amino acid, a nucleic acid, a vitamin , a saccharide, an organic acid, and analogous thereof by a fermentation 
method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtained by (i); 

(iii) introducing the mutation point into a coryneform bacterium which is free of the mutation point; and 

(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
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bacterium obtained In (III). 

(53) The method according to (52), wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
i^ZEZZ^ (52). wherein the mutation point is a mutagen point reiating to a usefu. mutation 
^A^i^rSKST^ using the nucieotide sequence information represented by 
SEQ ID NOS:1 to 3431, comprising: 

(I) comparing a nucleotide sequence of a genome or gene of a production strain derived a Wjtormb^" 

rium which has been subjected to mutation breeding so as to produce at least one compound r^Mdimn 

a^oadd,anucleicacid.avitamin,asaccharide,an organic acid, and analogousthereof by a fermentation 

method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtain by (.), 

rim deletino a mutation point from a coryneform bacterium having the mutation point; and 

($ ^SSS^SSSm, by the fermentation method of the compound selected in (.) of the coryneform 

bacterium obtained in (Hi). 

(56) The method according to (55), wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 

eXmZdi^^ 

^TSSflor breeding a coryneform bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:2 to 3431 , comprising the following: 

(i) identifying an isozyme relating to biosynthesis of at least one compound sel J^d ft™ i an, 

nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 
quence information represented by SEQ ID NOS:2 to 3431 ; 

(ii) classifying the isozyme identified in (0 into an isozyme having the same activity; 

(Hi) mutating all genes encoding the isozyme having the same corvneform ba<> 

(iv) examining productivity by a fermentation method of the compound selected in (,) of the coryneform bac- 
terium which have been transformed with the gene obtained in (m). 

(59) A method for breeding a coryneform bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:2 to 3431 , comprising the following: 

m arranaina a function Information of an open reading frame (ORF) represented by SEQ ID NOS:2 to 3431 ; 

£3Lng an unknown biosynthesis pathway or signal transmission P^^^T^^Z 
in combination with information relating known biosynthesis pathway or signal transmission pathway of a co 

(M wrpari^hTp'athway explicated in (ill) with a biosynthesis pathway of a target useful product; and 
v) transgeneucally varying a coryneform bacterium based on the nucleotide sequence '"formation to erthe 
Senothen a pathway which is judged to be important in the biosynthesis of the target useful Product in (iv) or 
2 a ^S juaged not to be important in the biosynthesis of the target useful product In (iv). 

(60) A coryneform bacterium, bred by the method of any one of (52) to (59). r>~~~**~ 

(61) The coryneform bacterium according to (60), which fe a mteroorganism belonging to the genus Corynebac 
terium, the genus Brevibacterium, or the genus Microbactenum. corvnebac- 

(62) The conform bacterium according to (61), wherein the microorganism ^belonging to ^he g 

terium is selected from the group consisting of Corynebacteriumglutamicum, Corynebactenum acetoa adophdum. 
C^ZeblctriunVacetoglutamLm, Corynebactenum callunae. Corynebacterium hercuhs, Corynebactenum » 

(63) A method for producing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a 
saccharide, an organic acid and an analogue thereof, comprising: 

culturing a coryneform bacterium of any one of (60) to (62) in a medium to produce and accumulate at least 
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one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof ; 

recovering the compound from the culture. 

(64) The method according to (63), wherein the compound is L-lyslne. 

(65) A method for identifying a protein relating to useful mutation based on proteome analysis, comprising the 
following: 

(i) preparing 

a protein derived from a bacterium of a production strain of a coryneform bacterium which has been sub- 
jected to mutation breeding by a fermentation process so as to produce at least one compound selected 
from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof, and 
a protein derived from a bacterium of a parent strain of the production strain; 

(ii) separating the proteins prepared in (I) by two dimensional electrophoresis; 

(Hi) detecting the separated proteins, and comparing an expression amount of the protein derived from the 
production strain with that derived from the parent strain; 

(fv) treating the protein showing different expression amounts as a result of the comparison with a peptidase 
to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained in (iv); and 

(vi) comparing the amino acid sequences obtained in (v) with the amino acid sequence represented by SEQ 
ID NOS:3602 to 7001 to Identifying the protein having the amino acid sequences. 

As used herein, the term "proteome", which is a coined word by combining "protein" with "genome", refers to 
a method for examining of a gene at the polypeptide level. 

(66) The method according to (65), wherein the coryneform bacterium is a microorganism belonging to the genus 
Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

(67) The method according to (66), wherein the microorganism belonging to the genus Corynebacterium Is selected 
from the group consisting of Corynebacterium giutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetoglutamicum, Corynebacterium catlunae, corynebacterium herculis, Corynebacterium lilium Corynebacterium 
melassecola, Corynebacterium thermoaminogenes t and Corynebacterium ammoniagenes. 

(68) A biologically pure culture of Corynebacterium giutamicum AHP-3 (FERM BP-7382). 

[0018] The present invention will be described below in more detail, based on the determination of the full nucleotide 
sequence of coryneform bacteria. 

1 . Determination of full nucleotide sequence of coryneform bacteria 

[0019] The term "coTyrieform bacteria" as used herein means a microorganism belonging to the genus Corynebac- 
terium, the genus Brevibacterium or the genus Microbacterium as defined In Bergeys Manual of Determinative Bacte- 
riology, &. 599 (197 '4). 

[0020] Examples incl ude Corynebacterium acetoacidophilum, Corynebacterium acetoglutamicum, Corynebacterium 
caiiunae, Corynebacterium giutamicum, Corynebacterium herculis, Corynebacterium lilium, Corynebacterium melas- 
secola, Corynebacterium thermoaminogenes, Brevibacterium saccharotyticum, Brevibacterium immariophilum, Brevi- 
bacterium roseum, Brevibacterium thiogenitaiis, Microbacterium ammoniaphiium, and the like. 
[0021] Specific examples include Corynebacterium acetoacidophilum ATCC 13870, Corynebacterium acetoglutami- 
cum ATCC 15806, Corynebacterium caiiunae ATCC 15991, Corynebacterium giutamicum ATCC 13032, Corynebac- 
terium giutamicum ATCC 1 3060, Corynebacterium giutamicum ATCC 1 3826 (prior genus and species: Brevibacterium 
ffavum, or Corynebacterium lactofermentum), Corynebacterium giutamicum ATCC 14020 (prior genus and species: 
Brevibacterium divaricatum), Corynebacterium giutamicum ATCC 13869 (prior genus and species: Brevibacterium 
lactofermentum), Corynebacterium herculis ATCC 13868, Corynebacterium lilium ATCC 15990, Corynebacterium 
melassecola ATCC 17965, Corynebacterium thermoaminogenes FERM '9244, Brevibacterium saccharotyticum ATCC 
14066, Brevibacterium immariophilum ATCC 14068, Brevibacterium roseum ATCC 13825, Brevibacterium thiogenitaiis 
ATCC 1 9240, Microbacterium ammoniaphiium ATCC 1 5354, and the like. 
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(1) Preparation of genome DNA of coryneform bacteria 

[0022] Coryneform bacteria can be cultured by a conventional method. 

[0023] Any of a natural medium and a synthetic medium can be used, so long as it is a medium suitable for efficient 
culturing of the microorganism, and it contains a carbon source, a nitrogen source, an inorganic salt, and the like which 
can be assimilated by the microorganism. 

[0024] In Corynebacterium glutamicum, for example, a BY medium (7 g/l meat extract, 1 0 g/l peptone, 3 g/l sodium 
chloride, 5 g/l yeast extract, pH 7.2) containing 1% of glycine and the like can be used. The culturing is earned out at 
25 to 35*C overnight. . 
[0025] After the completion of the culture, the cells are recovered from the culture by centnfugation. The resulting 

cells are washed with a washing solution. 

[0026] Examples of the washing solution include STE buffer (1 0.3% sucrose, 25 mmol/l Tris hydrochloride, 25 mmol/ 
I ethyienediaminetetraacetic acid (hereinafter referred to as "EDTA"), pH 8.0), and the like. 

[0027] Genome DNA can be obtained from the washed cells according to a conventional method for obtaining ge- 
nome DNA, namely, lysing the cell wall of the cells using a lysozyme and a surfactant (SDS, etc.), eliminating proteins 
and the like using a phenol solution and a phenol/chloroform solution, and then precipitating the genome DNA with 
ethanol or the like. Specifically, the following method can be illustrated. 

[0028] The washed cells are suspended in a washing solution containing 5 to 20 mg/l lysozyme. After shaking, 5 to 
20% SDS is added to lyse the cells. In usual, shaking is gently performed at 25 to 40°C for 30 minutes to 2 hours. After 
shaking, the suspension is maintained at 60 to 70°C for 5 to 1 5 minutes for the lysis. 

[0029] After the lysis, the suspension is cooled to ordinary temperature, and 5 to 20 ml of Tris-neutralized phenol is 
added thereto, followed by gently shaking at room temperature for 15 to 45 minutes. 

[0030] After shaking, centrif ugation (1 5,000 x g, 20 minutes, 20°C) is carried out to fractionate the aqueous layer. 
[0031] After performing extraction with phenol/chloroform and extraction with chloroform (twice) In the same manner, 
3 mol/l sodium acetate solution (pH 5.2) and isopropanol are added to the aqueous layer at 1/10 times volume and 2 
times volume, of the aqueous layer, respectively, followed by gently stirring to precipitate the genome DNA. 
[0032] The genome DNA is dissolved again in a buffer containing 0.01 to 0.04 mg/ml RNase. As an example of the 
buffer TE buffer (10 mmol/l Tris hydrochloride, 1 mol/l EDTA, pH 8.0) can be used. After dissolving, the resultant 
solution is maintained at 25 to 40°C for 20 to 50 minutes and then extracted successively with phenol, phenol/chloroform 
and chloroform as in the above case. 

[0033] After the extraction, isopropanol precipitation is carried out and the resulting DNA precipitate is washed with 
70% ethanol, followed by air drying, and then dissolved in TE buffer to obtain a genome DNA solution. 

(2) Production of shotgun library 

[0034] A method for produce a genome DNA library using the genome DNA of the coryneform bacteria prepared in 
the above (1) include a method described in Molecular Cloning, A laboratory Manual, Second Edition (1989) (hereinafter 
referred to as "Molecular Cloning, 2nd ed."). in particular, the following method can be exemplified to prepare a genome 
DNA library appropriately usable in determining the full nucleotide sequence by the shotgun method. 
[0035] To 0 01 mg of the genome DNA of the coryneform bacteria prepared In the above (1) , a buffer, such as TE 
buffer or the like, is added to give a total volume of 0.4 ml. Then, the genome DNA is digested into fragments of 1 to 
10 kb with a sonicator (Yamato Powersonic Model 50). The treatment with the sonicator is performed at an output of 
20 continuously for 5 seconds. 

[0036] The resulting genome DNA fragments are blunt-ended using DNA blunting kit (manufactured by TakaraShuzo) 

or the like. , , , 

[0037] The blunt-ended genome fragments are fractionated by agarose gel or poly aery lam ide gel electrophoresis 

and genome fragments of 1 to 2 kb are cut out from the gel. 

[0038] To the gel, 0.2 to 0.5 ml of a buffer for eluting DNA, such as MG elution buffer (0.5 mol/l ammonium acetate, 
10 mmol/l magnesium acetate, 1 mmol/l EDTA, 0.1% SDS) or the like, is added, followed by shaking at 25 to 40«C 
overnighttoeluteDNA. 

[0039] The resulting DNA eluate is treated with phenol/chloroform and then precipitated with ethanol to obtain a 
genome library insert. 

[0040] This insert is ligated into a suitable vector, such as pUC1 8 SmaUSAP (manufactured by Amersham Pharmacia 
Biotech) or the like, using T4 ligase (manufactured by Takara Shuzo) or the like. The ligation can be carried out by 
allowing a mixture to stand at 10 to 20°C for 20 to 50 hours, 

[0041] The resulting ligation product is precipitated with ethanol and dissolved in 5 to 20 uJ of TE buffer. 

[0042] Escherichia coli is transformed in accordance with a conventional method using 0.5 to 2 uJ of the ligation 

solution Examples of the transformation method include the electroporation method using ELECTRO MAX DHIOB 
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(manufactured by Life Technologies) for Escherichia co/L The electrop oration method can be carried out under the 
conditions as described In the manufacturer's Instructions. 

[0043] The transformed Escherichia coii Is spread on a suitable selection medium containing agar, for example, LB 
plate medium containing 10 to 100 mg/l amplcillin (LB medium (10 g/l bactotrypton, 6 g/l yeast extract, 10 g/l sodium 
chloride, pH 7.0) containing 1.6% of agar) when pUC18 is used as the cloning vector, and cultured therein. 
[0044] The transformant can be obtained as colonies formed on the plate medium. In this step, ft is possible to select 
the transformant having the recombinant DNA containing the genome DNA as white colonies by adding X-gal and 
IPTG (isopropyl-p-thiogalactopyranoside) to the plate medium. 

[0045] The transformant is allowed to stand for culturing in a 96-well titer plate to which 0.05 ml of the LB medium 
containing 0.1 mg/ml of ampicillin has been added In each well. The resulting culture can be used in an experiment of 
(4) described below. Also, the culture solution can be stored at -80*C by adding 0.05 ml per well of the LB medium 
containing 20% glycerol to the culture solution, followed by mixing, and the stored culture solution can be used at anv " 
time. 

(3) Production of cosmid library 

[0046] The genome DNA (0.1 mg) of the coryneform bacteria prepared In the above (1) is partially digested with a 
restriction enzyme, such as Sau3AI or the like, and then uitracentrifuged (26,000 rpm, 18 hours, 20°C) under a 10 to 
40% sucrose density gradient using a 1 0% sucrose buffer (1 mol/l Nad, 20 mmol/l Tris hydrochloride, 5 mmol/l EDTA, 
1 0% sucrose, pH 8.0) and a 40% sucrose buffer (elevating the concentration of the 1 0% sucrose buffer to 40%). 
[0047] After the centrlfugatlon, the thus separated solution Is fractionated into tubes in 1 ml per each tube. After 
confirming the DNA fragment size of each fraction by agarose gel electrophoresis, a fraction rich in DNA fragments of 
about 40 kb is precipitated with ethanol. 

[0048] The resulting DNA fragment is ligated to a cosmid vector having a cohesive end which can be ligated to the 
fragment. When the genome DNA is partially digested with Sau3AI, the partially digested product can be ligated to, 
for example, the Ba/nHI site of superCosI (manufactured by Stratagene) in accordance with the manufacture's instruc- 
tions. 

[0049] The resulting ligation product is packaged using a packaging extract which can be prepared by a method 
described in Molecular Cloning, 2nd ed. and then used in transforming Escherichia co/L More specifically, the ligation 
product is packaged using, for example, a commercially available packaging extract, Gigapack III Gold Packaging 
Extract (manufactured by Stratagene) in accordance with the manufacture's instructions and then introduced into Es- 
cherichia coii XL-1 -BlueMR (manufactured by Stratagene) or the like. 

[0050] The thus transformed Escherichia coii is spread on an LB plate medium containing ampicillin, and cultured 
therein. 

[0051] The transformant can be obtained as colonies formed on the plate medium. 

[0052] The transformant is subjected to standing culture in a 96-well titer plate to which 0.05 ml of the LB medium 
containing 0.1 mg/ml ampicillin has been added. 

[0053] The resulting culture can be employed in an experiment of (4) described below. Also, the culture solution can 
be stored at -80°C by adding 0.05 ml per well of the LB medium containing 20% glycerol to the culture solution, followed 
by mixing, and the stored culture solution can be used at any time. 

(4) Determination of nucleotide sequence 
(4-1) Preparation of template 

[0054] The full nucleotide sequence of genome DNA of coryneform bacteria can be determined basically according 
to the whole genome shotgun method (Science, 269: 496-51 2 (1 995)). 

[0055] The template used in the whole genome shotgun method can be prepared by PCR using the library prepared 

in the above (2) (DNA Research, 5: 1 -9 (1 998)). 

[0056] Specifically, the template can be prepared as follows. 

[0057] The clone derived from the whole genome shotgun library is inoculated by using a replicator (manufactured 
by G EN ETIX) into each well of a 96-well plate to which 0.08 ml per well of the LB medium containing 0.1 mg/ml ampicillin 
has been added, followed by stationarily culturing at 37°C overnight. 

[0058] Next, the culture solution is transported, using a copy plate (manufactured by Tokken), into each well of a 
96-well reaction plate (manufactured by PE Biosystems) to which 0.025 ml per well of a PCR reaction solution has 
been added using TaKaRa Ex Taq (manufactured by Takara Shuzo). Then, PCR is carried out in accordance with the 
protocol by Makino et al. (DNA Research, 5: 1-9 (1998)) using GeneAmp PCR System 9700 (manufactured by PE 
Biosystems) to amplify the inserted fragments. 
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[0059] The excessive primers and nucleotides are eliminated using a kit for purifying a PCR product, and the product 
is used as the template In the sequencing reaction. i-*- 

0060] It is also possible to determine the nucleotide sequence using a double-stranded DNA plasmid as a template. 
[0061] The double-stranded DNA plasmid used as the template can be obtained by the Rowing . ™thod 

0062 Theclonederivedfromthewholegen^eshotgunlibrarylslnoculatedlrtoeachv^ll^ 
to which 1 5 ml per well of a 2 x YT medium (1 6 g/l bactotrypton, 1 0 g/l yeast extract. S g/l sodium chlonde, pH 7.0) 
containing 0.05 mg/ml ampicillin has been added, followed by culturing under shaking at 37*C overnight 
S>63] The double-stranded DNA plasmid can be prepared from the culture solut,on using an automatic , ptamU 

preparing machine KURABO PI-50 (manufactured by Kurabo Industries), a mult.screen (manufactured by M.ll.pore) 

or the like, according to each protocol. 

[0064] To purify the plasmid. Biomek 2000 manufactured by Beckman Coulter and the like can be used 

[0065] The resulting purified double-stranded DNA plasmid is dissolved in water to give a concentrate of about 0.1 

mg/ml. Then, it can be used as the template in sequencing. 

(4-2) Sequencing reaction 

[0066] The sequencing reaction can be carried out according to a commercially available sequence kit or the like. A 
specific method is exemplified below. / _„„.,.„^»,, m H 
[0067] To 6 ill of a solution of ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction KM £»«MMund 
Ly PE Biosystems). 1 to 2 pmo. of an M1S regular direction primer (M13-21) or an M1 3 reverse Mp^ 
(MI3REV) (DM4 Research, S: 1-9 (1998)) and 50 to 200 ng of the template prepared in the above (4-1) (the PCR 
product or plasmid) to give 10 uJ of a sequencing reaction solution. j#.^»«a_ 
[0068] Adyetermlnatorsequendng reason (35 to M 
PCRSystem9700(manufacturedbyPEBiosystem^ 

with a commercially available kit. for example, the manufacture's Instructions attached with ABI PRISM Big Dye Ter- 
minator Cycle Sequencing Ready Reaction Kit. #— 
[0069] The sample can be purified using a commercially available product, such as Multi Screen HV plate (manu- 
factured by Millipore) or the like, according to the manufacture's instructions. -ru-rtrtoH 
[0070] The thus purified reaction product is precipitated wrth ethanol, dried and then used for the analysis. The dned 
reaction product can be stored in the dark at -30"C and the stored reaction product can be used at any time. 
[0071] The dried reaction product can be analyzed using a commercially available sequencer and an analyzer ac- 
eordina to the manufacture's instructions. M a , 

Examples of the commercially available sequencer include ABI PRISM 377 DNA Sequencer (manufactured 
by PE Biosystems). Example of the analyzer include ABI PRISM 3700 DNAAnalyzerfmanufacturedby PE Biosystems). 

(5) Assembly 

[0073] A software, such as phred (The University of Washington) or the like, can be use d as base call for use in 
analyzing the sequence information obtained in the above (4). A software, such as Cr ° ss - Match ^ 
Washington) or SPS Cross_Match (manufactured by Southwest Parallel Software) or the like, can be used to mask 

the vector sequence information. . . . . . 

[0074] For the assembly, a software, such as phrap (The University of Washington). SPS phrap (manufactured by 
Southwest Parallel Software) or the like, can be used. M o^i»t««h a nHth« 

[0075] in the above, analysis and output of the results thereof, a computer such as UNIX. PC. Macintosh, and the 

0076]" "con^obtained by the assembly can be analyzed using a graphical editor such as consed (The University 

IttaloposS to perform a series of the operations from the base call to the assembly in a lump using a 
script phredPhrap attached to the consed. 

[0078] As used herein, software will be understood to also be referred to as a comparator. 
(6) Determination of nucleotide sequence in gap part 

[0079] Each of the cosmids in the cosmid library constructed in the above <3) is prepared in the same manner as in 
he preparation of the double-stranded DNA plasmid described in the above (4-1). The nucleotide ^ u <^»""* 
of the insert fragment of the cosmid is determined using a commercially available kit, such as ABI PRISM _ BigDye 
Terminator Cycle Sequencing Ready Reaction Kit (manufactured by PE Biosystems) accord.ng to the manufactures 
instructions. 
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[0080] About 800 cosmid clones are sequenced at both ends of the inserted fragment to detect a nucleotide sequence 
in the contlg derived from the shotgun sequencing obtained In (5) which is coincident with the sequence. Thus, the 
chain linkage between respective cosmid clones and respective contlgs are clarified, and mutual alignment is carried 
out. Furthermore, the results are compared with known physical maps to map the cosmids and the contlgs. In case of 
5 Cotynebacterium glutamlcum ATCC 1 3032, a physical map of Mol. Gen. Genet, 252: 255-265 (1 996) can be used. 
[0081] The sequence in the region which cannot be covered with the contigs (gap part) can be determined by the 
following method. 

[0082] Clones containing sequences positioned at the ends of the contigs are selected. Among these, a clone wherein 
only one end of the inserted fragment has been determined is selected and the sequence at the opposite end of the 
10 inserted fragment is determined. 

[0083] A shotgun library clone or a cosmid clone derived therefrom containing the sequences at the respective ends 
of the inserted fragments in the two contigs is identified and the full nucleotide sequence of the inserted fragment of 
the clone is determined. 

[0084] According to this method, the nucleotide sequence of the gap part can be determined. 
15 [0085] When no shotgun library clone or cosmid clone covering the gap part is available, primers complementary to 

the end sequences of the two different contigs are prepared and the DNA fragment in the gap part Is amplified. Then, 

sequencing is performed by the primer walking method using the amplified DNA fragment as a template or by the 

shotgun method in which the sequence of a shotgun clone prepared from the amplified DNA fragment is determined. 

Thus, the nucleotide sequence of the above-described region can be determined. 
20 [0086] In a region showing a low sequence accuracy, primers are synthesized using AUTOFINISH function and 

NAVIGATING function of consed (The University of Washington), and the sequence is determined by the primer walking 

method to improve the sequence accuracy. 

[0087] Examples of the thus determined nucleotide sequence of the full genome Include the full nucleotide sequence 
of genome of Corynebacterium glutamlcum ATCC 1 3032 represented by SEQ ID NO:1 . 

25 

(7) Determination of nucleotide sequence of microorganism genome DNA using the nucleotide sequence represented 
by SEQ ID NO:1 

[0088] A nucleotide sequence of a polynucleotide having a homology of 80% or more with the full nucleotide sequence 
30 of Corynebacterium glutamicum ATCC 13032 represented by SEQ ID NO:1 as determined above can also be deter- 
mined using the nucleotide sequence represented by SEQ ID NO:1, and the polynucleotide having a nucleotide se- 
quence having a homology of 80% or more with the nucleotide sequence represented by SEQ ID NO:1 of the present 
invention is within the scope of the present invention. The term "polynucleotide having a nucleotide sequence having 
a homology of 80% or more with the nucleotide sequence represented by SEQ' ID NO:1 of the present invention- is a 
35 polynucleotide in which a full nucleotide sequence of the chromosome DNA can be determined using as a primer an 
oligonucleotide composed of continuous 5 to 50 nucleotides in the nucleotide sequence represented by SEQ ID NO: 

1 , for example, according to PCR using the chromosome DNA as a template. A particularly preferred primer in deter- 
mination of the full nucleotide sequence Is an oligonucleotide having nucleotide sequences which are positioned at 
the interval of about 300 to 500 bp, and among such oligonucleotides, an oligonucleotide having a nucleotide sequence 

40 selected from DNAs encodirig^a proteln relating to a main metabolic pathway is particularly preferred. The polynucle- 
otide in which the full nucleotide sequence of the chromosome DNA can be determined using the oligonucleotide 
includes polynucleotides constituting a chromosome DNA derived from a microorganism belonging to coryneform bac- 
teria. Such a polynucleotide is preferably a polynucleotide constituting chromosome DNA derived from a microorganism 
belonging to the genus Corynebacterium, more preferably a polynucleotide constituting a chromosome DNA of Co- 

45 rynebacterium glutamicum. 

2. Identification of ORF (open reading frame) and expression regulatory fragment and determination of the function of 
ORF 

so [0089] Based on the full nucleotide sequence data of the genome derived from coryneform bacteria determined in 
the above item 1 , an ORF and an expression modulating fragment can be identified. Furthermore, the function of the 
thus determined ORF can be determined. 

[0090] The ORF means a continuous region in the nucleotide sequence of mRNA which can be translated as an 
amino acid sequence to mature tb~a protein. A region of the DNA coding for the ORF of mRNA is also called ORF. 
55 [0091] The expression modulating fragment (hereinafter referred to as M EMP) is used herein to define a series of 
polynucleotide fragments which modulate the expression of the ORF or another sequence ligated operatably thereto. 
The expression "modulate the expression of a sequence ligated operatably* is used herein to refer to changes in the 
expression of a sequence due to the presence of the EMF. Examples of the EMF include a promoter, an operator, an 
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bacteria, an EMF is usually present in an ' nt ^*"* *X*h, * esent in an intergenic segment of 10 nucleotides or 
200 nucleotides in length). Accordingly, an EMF to f requentty 'P^ntin an ^ | MF encesasatarge t 

Pharmacia Biotech). nucleotide seauence composed of 6 or more nude- 

l0 0921 The term target sequence" Is used ^J^^^T^SSSb sequence encoding this amino 
otides. an amino acid sequence composed of 2 " ^ appears at random in a data base at 

^^^^ 

otide residues. . , t fer t0 a sequence or a combination of 

[0093] The term "target structural motrf" or "target motrf" s e ba.1. of the threedimensional structure 

or expected as achieving high expression (for ^exampie, Mri £™J^ G °™ ^ lnduced b y acetic acid: 
Z467S3) or a gene showing a deseed hWriM ^^'^gUentwfth the full genome nucleotide 

amino acid translation sequences. „„™„ a f nm , bacteria can be identified by detenmining the 

[0103] Thus, a great number of novel genes denved from c0 ^ e £^^ ^ p y reS ent invention. 

iSS, hi*., mtawgmlr.. of M IdenUtteO 9 er». .re «UMV „«*». 
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[0104] Moreover, the characteristics of respective microorganisms can be clarified by classifying the functions thus 
determined. As a result, valuable information in breeding is obtained. 

[0105] Furthermore, from the ORF information derived from coryneform bacteria, the ORF corresponding to the 
microorganism is prepared and obtained according to the general method as disclosed in Molecular Cloning, 2nd ed. 

s or the like. Specifically, an oligonucleotide having a nucleotide sequence adjacent to the ORF is synthesized, and the 
ORF can be isolated and obtained using the oligonucleotide as a primer and a chromosome DNA derived from co- 
ryneform bacteria as a template according to the general PCR cloning technique. Thus obtained ORF sequences 
include polynucleotides comprising the nucleotide sequence represented by any one of SEQ ID NOS:2 to 3501. 
[0106] The ORF or primer can be prepared using a polypeptide synthesizer based on the above sequence informa- 

io tion. 

[0107] Examples of the polynucleotide of the present invention include a polynucleotide containing the nucleotide 
sequence of the ORF obtained in the above, and a polynucleotide which hybridizes with the polynucleotide under 
stringent conditions. 

[0108] The polynucleotide of the present invention can be a single-stranded DNA, a double-stranded DNA and a 

15 single-stranded RNA, though it is not limited thereto. 

[01 09] The polynucleotide which hybridizes with the polynucleotide containing the nucleotide sequence of the ORF 
obtained in the above under stringent conditions includes a degenerated mutant of the ORF. A degenerated mutant is 
a polynucleotide fragment having a nucleotide sequence which is different from the sequence of the ORF of the present 
invention which encodes the same amino acid sequence by degeneracy of a gene code. 

20 [0110] Specific examples include a polynucleotide comprising the nucleotide sequence represented by any one of 
SEQ ID NOS:2 to 3431, and a polynucleotide which hybridizes with the polynucleotide under stringent conditions. 
[01 11] A polynucleotide which hybridizes under stringent conditions is a polynucleotide obtained by colony hybridi- 
zation, plaque hybridization, Southern blot hybridization or the like using, as a probe, the polynucleotide having the 
nucleotide sequence of the ORF identified in the above. Specific examples include a polynucleotide which can be 

25 identified by carrying out hybridization at 65°C in the presence of 0.7-1 .0 M NaCI using a filter on which a polynucleotide 
prepared from colonies or plaques is immobilized, and then washing the filter with 0.1x to 2x SSC solution (the com- 
position of Ix SSC contains 150 mM sodium chloride and 15 mM sodium citrate) at 65°C. 

[01 1 2] The hybridization can be carried out in accordance with known methods described in, for example, Molecular 
Cloning, 2nd ed., Current Protocols in Molecular Biology, DNA Cloning 1: Core Techniques, A Practical Approach, 

30 Second Edition, Oxford University (1 995) or the like. Specific examples of the polynucleotide which can be hybridized 
include a DNA having a homology of 60% or more, preferably 80% or more, and particularly preferably 95% or more, 
with the nucleotide sequence represented by any one of SEQ ID NO:2 to 3431 when calculated using default (initial 
setting) parameters of a homology searching software, such as BLAST, FASTA, Smith-Waterman or the like. 
[01 13] Also, the polynucleotide of the present invention includes a polynucleotide encoding a polypeptide comprising 

35 the amino acid sequence represented by any one of SEQ ID NOS:3502 to 6931 and a polynucleotide which hybridizes 
with the polynucleotide under stringent conditions. 

[0114] Furthermore, the polynucleotide of the present invention includes a polynucleotide which is present in the 5* 
upstream or 3' downstream region of a polynucleotide comprising the nucleotide sequence of any one of SEQ ID NOS: 
2 to 3431 in a polynucleotide comprising the nucleotide sequence represented by SEQ ID NO:1 , and has an activity 
of regulating an expression of a polypeptide encoded by the polynucleotide. Specific examples of the polynucleotide 
having an activity of regulating an expression of a polypeptide encoded by the polynucleotide includes a polynucleotide 
encoding the above described EMF, such as a promoter, an operator, an enhancer, a silencer, a ribosome-blnding 
sequence, a transcriptional termination sequence, and the like. 

[0115] The primer used for obtaining the ORF according to the above PCR cloning technique includes an olfgonu- 
45 cleotide comprising a sequence which is the same as a sequence of 1 0 to 200 continuous nucleotides in the nucleotide 
sequence of the ORF and an adjacent region or an oligonucleotide comprising a sequence which is complementary 
to the oligonucleotide. Specific examples include an oligonucleotide comprising a sequence which Is the same as a 
sequence of 10 to 200 continuous nucleotides of the nucleotide sequence represented by any one of SEQ ID NOS:1 
to 3431 , and an oligonucleotide comprising a sequence complementary to the oligonucleotide comprising a sequence 
50 of at least 1 0 to 20 continuous nucleotide of any one of SEQ ID NOS: 1 to 3431 . When the primers are used as a sense 
primer and an antisense primer, the above-described oligonucleotides in which melting temperature (T m ) and the 
number of nucleotides are not significantly different from each other are preferred. 

[01 16] The oligonucleotide of the present invention includes an oligonucleotide comprising a sequence which is the 
same as 10 to 200 continuous nucleotides of the nucleotide sequence represented by any one of SEQ ID NOS:1 to 
55 3431 or an oligonucleotide comprising a sequence complementary to the oligonucleotide. 

[0117] Also, analogues of these oligonucleotides (hereinafter also referred to as "analogous oligonucleotides") are 
also provided by the present invention and are useful in the methods described herein. 

[0118] Examples of the analogous oligonucleotides include analogous oligonucleotides in which a phosphodiester 
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bond in an oligonucleotide is converted to a phosphorothioate bond, analogous oligonucleotides in which a phosphodl- 
ester bond in an oligonucleotide is converted to an N3'-P5' phosphoamidate bond, analogous oligonucleotides in which 
ribose and a phosphodiester bond in an oligonucleotide is converted to a peptide nucleic acid bond, analogous oligo- 
nucleotides In which uracil in an oligonucleotide is replaced with C-5 propynyluracil, analogous oligonucleotides in 

5 which uracil in an oligonucleotide is replaced with C-5 thiazoluracll, analogous oligonucleotides in which cytosine in 
an oligonucleotide is replaced with C-5 propynylcytosine, analogous oligonucleotides in which cytosine in an oligonu- 
cleotide is replaced with phenoxazine-modified cytosine, analogous oligonucleotides in which ribose in an oligonucle- 
otide is replaced with 2'-0-propylribose, analogous oligonucleotides in which ribose in an oligonucleotide is replaced 
with 2*-methoxyethoxyribose, and the like {Cell Engineering, 1& 1463 (1997)). 

10 [01 1 9] The above oligonucleotides and analogous oligonucleotides of the present invention can be used as probes 
for hybridization and antisense nucleic acids described below in addition to as primers. 

[0120] Examples of a primer for the antisense nucleic acid techniques known in the art include an oligonucleotide 
which hybridizes the oligonucleotide of the present invention under stringent conditions and has an activity regulating 
expression of the polypeptide encoded by the polynucleotide, in addition to the above oligonucleotide. 

15 

3. Determination of isozymes 

[0121] Many mutants of coryneform bacteria which are useful in the production of useful substances, such as amino 
acids, nucleic acids, vitamins, saccharides, organic acids, and the like, are obtained by the present invention. 

20 [0122] However, since the gene sequence data of the microorganism has been, to date, insufficient, useful mutants 
have been obtained by mutagenic techniques using a mutagen, such as nitrosoguanidine (NTG) or the like. 
[0123] Although genes can be mutated randomly by the mutagenic method using the above-described mutagen, all 
genes encoding respective isozymes having similar properties relating to the metabolism of intermediates cannot be 
mutated. In the mutagenic method using a mutagen, genes are mutated randomly. Accordingly, harmful mutations 

25 worsening culture characteristics, such as delay in growth, accelerated foaming, and the like, might be imparted at a 
great frequency, in a random manner. 

[01 24] However, if gene sequence information is available, such as is provided by the present invention, it is possible 
to mutate all of the genes encoding target isozymes. In this case, harmful mutations may be avoided and the target 
mutation can be incorporated. 

30 [0125] Namely, an accurate number and sequence information of the target isozymes in coryneform bacteria can be 
obtained based on the ORF data obtained in the above item 2. By using the sequence information, all of the target 
isozyme genes can be mutated into genes having the desired properties by, for example, the site-specific mutagenesis 
method described in Molecular Cloning, 2nd ed. to obtain useful mutants having elevated productivity of useful sub- 
stances. 

35 

4. Clarification or determination of biosynthesis pathway and signal transmission pathway 

[01 26] Attempts have been made to elucidate biosynthesis pathways and signal transmission pathways in a number 
of organisms, and many findings have been reported. However, there are many unknown aspects of coryneform bac- 
40 teria since a number of genes have not been Identified so far. 

[0127] These unknown points can be clarified by the following method. 

[01 28] The functional information of ORF derived from coryneform bacteria as identified by the method of above item 
2 is arranged. The term "arranged" means that the ORF is classified based on the biosynthesis pathway of a substance 
or the signal transmission pathway to which the ORF belongs using known information according to the functional 
45 information. Next, the arranged ORF sequence information is compared with enzymes on the biosynthesis pathways 
or signal transmission pathways of other known organisms. The resulting information is combined with known data on 
coryneform bacteria. Thus, the biosynthesis pathways and signal transmission pathways in coryneform bacteria, which 
have been unknown so far, can be determined. 

[0129] As a result that these pathways which have been unknown or unclear hitherto are clarified, a useful mutant 
50 for producing a target useful substance can be efficiently obtained. 

[0130] When the thus clarified pathway is judged as important in the synthesis of a useful product, a useful mutant 
can be obtained by selecting a mutant wherein this pathway has been strengthened. Also, when the thus clarified 
pathway is judged as not important in the biosynthesis of the target useful product, a useful mutant can be obtained 
by selecting a mutant wherein the utilization frequency of this pathway is lowered. 

55 

5. Clarification or determination of useful mutation point 

[01 31] Many useful mutants of coryneform bacteria which are suitable for the production of useful substances, such 
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as amino acids, nucleic acids, vitamins, saccharides, organic acids, and the like, have been obtained. However, it Is 
hardly known which mutation point is Imparted to a gene to improve the productivity. 

[0132] However, mutation points contained In production strains can be Identified by comparing desired sequences 
of the genome DNA of the production strains obtained from coryneform bacteria by the mutagenic technique with the 
nucleotide sequences of the corresponding genome DNA and ORF derived from coryneform bacteria determined by 
the methods of the above items 1 and 2 and analyzing them s 

[0133] Moreover, effective mutation points contributing to the production can be easily specified from among these 
mutation points on the basis of known information relating to the metabolic pathways, the metabolic regulatory mech- 
anisms, the structure activity correlation of enzymes, and the like. 

[01 34] When any efficient mutation can be hardly specified based on known data, the mutation points thus identified 
can be introduced into a wild strain of coryneform bacteria or a production strain free of the mutation. Then, it is examined 
whether or not any positive effect can be achieved on the production. 

[0135] For example, by comparing the nucleotide sequence of homoserine dehydrogenase gene horn of a lysine- 
producing B-6 strain of Corynebacterium glutamicum (AppL Microbiol Biotechnol., 32: 269-273 (1989)) with the nu- 
cleotide sequence corresponding to the genome of Corynebacterium glutamicum ATCC 1 3032 according to the present 
invention, a mutation of amino acid replacement in which valine at the 59-position is replaced with alanine (Val59Ala) 
was Identified. A strain obtained by introducing this mutation into the ATCC 13032 strain by the gene replacement 
method can produce lysine, which indicates that this mutation is an effective mutation contributing to the production 
of lysine. 

[0136] Similarly, by comparing the nucleotide sequence of pyruvate carboxylase gene pyc of the B-6 strain with the 
nucleotide sequence corresponding to the ATCC 1 3032 genome, a mutation of amino acid replacement in wh Ich proline 
at the 468-position was replaced with serine (Pro45BSer) was Identified. A strain obtained by introducing this mutation 
into a lysine-producing strain of No. 58 (FERM BP-71 34) of Corynebacterium glutamicum free of this mutation shows 
an improved lysine productivity In comparison with the No. 58 strain, which indicates that this mutation Is an effective 
mutation contributing to the production of lysine. 

[01 37] In addition, a mutation A1 a2 1 3Thr in glucose-6-phosphate dehydrogenase was specified as an effective mu- 
tation relating to the production of lysine by detecting giucose-6-phosphate dehydrogenase gene zwf of the B-6 strain. 
[0138] Furthermore, the lyslne-productivity of Corynebacterium glutamicum was improved by replacing the base at 
the 932-position of aspartokinase gene tysC of the Corynebacterium glutamicum ATCC 13032 genome with cytosine 
to thereby replace threonine at the 31 1 -position by isoleucine, which indicates that this mutation is an effective mutation 
contributing to the production of lysine. 

[01 39] Also, as another method to examine whether or not the identified mutation point is an effective mutation, there 
is a method in which the mutation possessed by the lysine-producing strain is returned to the sequence of a wild type 
strain by the gene replacement method and whether or not it has a negative influence on the lysine productivity. For 
example, when the amino acid replacement mutation Val59Aia possessed by horn of the lysine-producing B-6 strain 
was returned to a wild type amino acid sequence, the lysine productivity was lowered in comparison with the B-6 strain. 
Thus, ft was found that this mutation is an effective mutation contributing to the production of lysine. 
[01 40] Effective mutation points can be more efficiently and comprehensively extracted by combining, if needed, the 
- DNA array analysis or proteome analysis described below. 

6. Method of breeding industrially advantageous production strain 

[0141] it has been a general practice to construct production strains, which are used industrially in the fermentation 
production of the target useful substances, such as amino acids, nucleic acids, vitamins, saccharides, organic acids, 
and the like, by repeating mutagenesis and breeding based on random mutagenesis using mutagens, such as NTG 
or the (ike, and screening. 

[0142] In recent years, many examples of improved production strains have been made through the use of recom- 
binant DNA techniques. In breeding, however, most of the parent production strains to be improved are mutants ob- 
tained by a conventional mutagenic procedure (W. Leuchtenberger, Amino Acids - Technical Production and Use. In: 
Roehr (ed) Biotechnology, second edition, vol. 6, products of primary metabolism. VCH Verlagsgesellschaft mbH, Wein- 
heim, P 465 (1996)). 

[0143] Although mutagenesis methods have largely contributed to the progress of the fermentation industry, they 
suffer from a serious problem of multiple, random introduction of mutations into every part of the chromosome. Since 
many mutations are accumulated in a single chromosome each time a strain is improved, a production strain obtained 
by the random mutation and selecting is generally inferior in properties (for example, showing poor growth, delayed 
consumption of saccharides, and poor resistance to stresses such as temperature and oxygen) to a wild type strain, 
which brings about troubles such as failing to establish a sufficiently elevated productivity, being frequently contami- 
nated with miscellaneous bacteria, requiring troublesome procedures in culture maintenance, and the like, and, in its 
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expression amount and the expression profile thereof can be analyzed. Q ^ rtrH i„n to 

[01 67] The nucleic acid molecule (DNA, RN A) derived from the corynef orm bacteria can be obtained according \lo 
the general method described in Molecular Cloning, 2nd ed. orthe like. mRNA derived from ^°tecteriumglutaml- 
cl can also be obtained by the method of Bormann et al. (Molecular Microbiology, ft 31 7-326 (1 992)) or the lite 
[01 68] Although ribosomal RNA (rRNA) is usually obtained in large excess in addition to the target mRNA, the anal- 
ysis is not seriously disturbed thereby. . k . M ^«„t 
[0169] The resulting nucleic acid molecule derived from coryneform bacteria is labeled. Labeling can be carried out 
according to a method using a fluorescent dye, a method using a radioisotope or the like. avtroot(Wl f rom 

[0170] Specific examples include a labeling method in which psoralen-biotin is crossl.nked with RNA extracted from 
a microorganism and, after hybridization reaction, a fluorescent dye having streptoavidin bound thereto is bound to 
the biotin moiety (Nat. Blotechnol., f ft 45-48 (1998)); a labeling method in which a reverse transition rearttonl 
carried out using RNA extracted from a microorganism as a template and random primers as primers, and dUTP havmg 
a fluorescent dye (for example, Cy3, Cy5) (manufactured by Amersham Pharmacia Biotech) is incorporated into cDNA 
tProc. Natl. Acad. Sd. USA, 9&. 1 2833-38 (1 999)); and the like. m „ toma „ ta ™ tn 

[0171] The labeling specificity can be improved by replacing the random primers by sequences complementary to 
the 3'-end of ORF {J. Bacterid., 181: 6425-40 (1 999)). nanor *i 
[0172] In the hybridization method, the hybridization and subsequent washing can be earned out by the general 
method (Afaf. Bioctechnol., 14: 1675-80 (1996), orthe like). „ t ... , ..^ 

r01731 Subsequently, the hybridization intensity is measured depending on the hybndizatlon amount of the nucleic 
acid molecule used in the labeling. Thus, the mutation point can be identified and the expression amount of the gene 

Sni] °t52 hybndizatlon intensity can be measured by visualizing the fluorescent signal, radioactivity, lumin^ree 
dose, andthe like, using a laser confocal microscope, a CCD camera, a radiation imaging device (for example, STORM 
manufactured by Amersham Pharmacia Biotech), and the like, and then quantifying the thus visualized data. 
[0175] A polynucleotide array on a solid support can also be analyzed and quantified using a commercially available 
apparatus, such as GMS41 8 Array Scanner (manufactured by TakaraShuzo) or the like. 

[0176] The gene expression amount can be analyzed using a commercially available software (for example, ImaGene 
manufactured by Takara Shuzo; Array Gauge manufactured by Fuji Photo Film; ImageQuant manufactured by Amer- 
sham Pharmacia Biotech, orthe like). , . . . 

[0177] A fluctuation in the expression amount of a specific gene can be monitored using a nucleic acid molecule 
obtained in the time course of culture as the nucleic acid molecule derived from coryneform bactena. The culture 

conditions can be optimized by analyzing the fluctuation. 

[0178] The expression profile of the microorganism at the total gene level (namely, which genes among a great 
number of genes encoded by the genome have been expressed and the expression ratio thereof) can be determined 
using a nucleic acid molecule having the sequences of many genes determined from the full genome sequence of the 
microorganism. Thus, the expression amount of the genes determined by the full genome sequence can be analyzed 
and, In its turn, the biological conditions of the microorganism can be recognized as the expression pattern at the full 
gene level. 

(b) Confirmation of the presence of gene homologous to examined gene In coryneform bacteria 

[0179] Whether or not a gene homologous to the examined gene, which is present in an organism other than co- 
ryneform bacteria, is present in coryneform bacteria can be detected using the polynucleotide array prepared in the 

roi°801 (1 This detection can be earned out by a method in which an examined gene which is present in an organism 
other than coryneform bacteria is used instead of the nucleic acid molecule derived from coryneform bacteria used in 
the above identification/analysis method of (1 ). 

8. Recording medium storing full genome nucleotide sequence and ORF data and being readable by a computer and 
methods for using the same 

[0181] The term "recording medium or storage device which is readable by a computer* means a recording medium 
or storage medium which can be directly readout and accessed with a computer. Examples include magnetic recording 
media, such as a floppy disk, a hard disk, a magnetic tape, and the like; optical recording media, such as CD-ROM, 
CD-R CD-RW DVD-ROM DVD-RAM, DVD-RW, and the like; electric recording media, such as RAM, ROM, and tne 
like; and hybrids in these categories (for example, magnetic/optical recording media, such as MO and the like). 
[0182] Instruments for recording or inputting in or on the recording medium or instruments or devices for reading out 
the information in the recording medium can be appropriately selected, depending on the type of the recording medium 



20 



EP 1 108 790 A2 



and the access device utilized. Also, various data processing programs, software, comparator and formats are used 
for recording and utilizing the polynucleotide sequence information or the like, of the present Invention in the recording 
medium. The information can be expressed in the form of a binary file, a text file or an ASCII file formatted with com- 
mercially available software, for example. Moreover, software for accessing the sequence information Is available and 

5 known to one of ordinary skill in the art. 

[0183] Examples of the information to be recorded in the above-described medium include the full genome nucleotide 
sequence information of coryneform bacteria as obtained in the above item 2, the nucleotide sequence information of 
ORF, the amino acid sequence information encoded by the ORF, and the functional information of polynucleotides 
coding for the amino acid sequences. 

10 [01 84] The recording medium or storage device which is readable by a computer according to the present invention 
refers to a medium in which the information of the present invention has been recorded. Examples include recording 
media or storage devices which are readable by a computer storing the nucleotide sequence information represented 
by SEQ ID NOS:1 to 3501 , the amino acid sequence information represented by SEQ ID NOS:3502 to 7001 , the 
functional information of the nucleotide sequences represented by SEQ ID NOS:1 to 3501, the functional information 

is of the amino acid sequences represented by SEQ ID NOS:3502 to 7001 , and the information listed in Table 1 below 
and the like. 

9. System based on a computer using the recording medium of the present invention which Is readable by a computer 

20 [0185] The term "system based on a computer" as used herein refers a system composed of hardware devlce(s), 
software devlce(s), and data recording device(s) which are used for analyzing the data recorded in the recording me- 
dium of the present invention which Is readable by a computer. 

[01 86] The hardware device(s) are, for example, composed of an input unit, a data recording unit, a central processing 
unit and an output unit collectively or individually. 

25 [0187] By the software device(s), the data recorded in the recording medium of the present invention are searched 
or analyzed using the recorded data and the hardware device(s) as described herein. Specifically, the software device 
(s) contain at least one program which acts on or with the system in order to screen, analyze or compare biologically 
meaningful structures or information from the nucleotide sequences, amino acid sequences and the like recorded in 
the recording medium according to the present invention. 

30 [0188] Examples of the software device (s) for identifying ORF and EMF domains include GeneMark (Nuc. Adds. 
Res., 22. 4756-67 (1994)), GeneHacker {Protein, Nucleic Acid and Enzyme, 42: 3001-07 (1997)), Glimmer (The Insti- 
tute of Genomic Research; Nuc. Acids. Res., 26: 544-548 (1 998)) and the like. In the process of using such a software 
device, the default (initial setting) parameters are usually used, although the parameters can be changed, if necessary, 
in a manner known to one of ordinary skill in the art. 

35 [0189] Examples of the software device(s) for identifying a genome domain or a polypeptide domain analogous to 
the target sequence or the target structural motif (homology searching) include FASTA, BLAST, Smith-Waterman, 
GenetyxMac (manufactured by Software Development), GCG Package (manufactured by Genetic Computer Group), 
GenCore (manufactured by Compugen), and the like. In the process of using such a software device, the default (initial 
setting) parameters are usually used, although the parameters can be changed, if necessary, in a manner known to 

40 one of ordinary skill in the art. 

[0190] Such a recording medium storing the full genome sequence data is useful in preparing a polynucleotide array 
by which the expression amount of a gene encoded by the genome DNA of coryneform bacteria and the expression 
profile at the total gene level of the microorganism, namely, which genes among many genes encoded by the genome 
have been expressed and the expression ratio thereof, can be determined. 

45 [0191] The data recording device(s) provided by the present invention are, for example, memory device(s) for re- 
cording the data recorded in the recording medium of the present invention and target sequence or target structural 
motif data, or the like, and a memory accessing devlce(s) for accessing the same. 

[0192] Namely, the system based on a computer according to the present invention comprises the following: 

so (i) a user input device that inputs the information stored in the recording medium of the present invention, and 

target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the information stored in the recording medium of the present invention with the 
target sequence or target structure motif information, recorded by the data storing device of (ii) for screening and 

55 analyzing nucleotide sequence information which is coincident with or analogous to the target sequence or target 

structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 
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[01 93] This system is usable in the methods in items 2 to 5 as described above for searching and analyzing the ORF 
and EMF domains, target sequence, target structural motif, etc, of a coryneform bacterium, searching homologs, 
searching and analyzing isozymes, determining the biosynthesis pathway and the signal transmission pathway, and 
identifying spots which have been found in the proteome analysis. The term "homologs" as used herein includes both 
5 of orthologs and paralogs. 

10. Production of polypeptide using ORF derived from coryneform bacteria 

[0194] The polypeptide of the present invention can be produced using a polynucleotide comprising the ORF obtained 
10 in the above item 2. Specifically, the polypeptide of the present invention can be produced by expressing the polynu- 
cleotide of the present invention or a fragment thereof in a host cell, using the method described in Molecular Cloning, 
2nd ed., Current Protocols in Molecular Biology, and the like, for example, according to the following method. 
[0195] A DNA fragment having a suitable length containing a part encoding the polypeptide is prepared from the full 
length ORF sequence, if necessary. 
is [0196] Also, DNA in which nucleotides in a nucleotide sequence at a part encoding the polypeptide of the present 
invention are replaced to give a codon suitable for expression of the host cell, if necessary. The DNA Is useful for 
efficiently producing the polypeptide of the present invention. 

[0197] A recombinant vector is prepared by inserting the DNA fragment into the downstream of a promoter in a 
suitable expression vector. 
20 [01 98] The recombinant vector is introduced to a host cell suitable for the expression vector. 

[01 99] Any of bacteria, yeasts, animal cells, insect cells, plant cells, and the like can be used as the host cell so long 
as It can be expressed in the gene of interest. 

[0200] Examples of the expression vector include those which can replicate autonomously in the above-described 
host cell or can be Integrated into chromosome and have a promoter at such a position that the DNA encoding the 

25 polypeptide of the present invention can be transcribed. 

[0201] When a procaryote cell, such as a bacterium or the like, is used as the host cell, it is preferred that the 
recombinant vector containing the DNA encoding the polypeptide of the present invention can replicate autonomously 
in the bacterium and is a recombinant vector constituted by, at least a promoter, a ribosome binding sequence, the 
DNA of the present invention and a transcription termination sequence. A promoter controlling gene can also be con- 

30 tained therewith in operable combination. 

[0202] Examples of the expression vectors include a vector plasmid which is replicable in Corynebacterium glutami- 
cum, such as pCGI (Japanese Published Unexamined Patent Application No. 134600/82), pCG2 (Japanese Published 
Unexamined Patent Application No. 35197/83), pCG4 (Japanese Published Unexamined Patent Application No. 
183799/82). pCG11 (Japanese Published Unexamined Patent Application No. 134600/82), pCG116, pCE54 and 

35 pCB1 01 (Japanese Published Unexamined Patent Application No. 1 05999/83), pCE51 , pCE52 and pCE53 (Mol. Gen. 
Genet, 196: 1 75-178 (1 984)), and the like; a vector plasmid which is replicable in Escherichia coii, such as pET3 and 
pET11 (manufactured by Stratagene), pBAD, pThioHis and pTrcHls (manufactured by Invitrogen), pKK223-3 and 
pGEX2T (manufactured by Amersham Pharmacia Biotech), and the like; and pBTrp2, pBTad and pBTac2 (manufac- 
tured by Boehringer-Mannheim Co.), pSE280 (manufactured by Invitrogen), pGEMEX-1 (manufactured by Promega), 

40 pQE-8 (manufactured by QIAGEN), pKYPIO (Japanese Published Unexamined Patent Application No. 110600/83), 
pKYP200 (Agric. Biol. Chem., 48: 669 (1984)), pLSA1 (Agric. Biol. Chem., S3: 277 (1989)), pGEL1 (Proc. Natl. Acad. 
Sci. USA, 82: 4306 (1 985)), pBluescript II SK(-) (manufactured by Stratagene), pTrs30 (prepared from Escherichia coli 
JM109/pTrS30 (FERM BP-5407)), pTrs32 (prepared from Escherichia coli JM109/pTrS32 (FERM BP-5408)), pGHA2 
(prepared from Escherichia coll IGHA2 (FERM B-400), Japanese Published Unexamined Patent Application No. 

45 221 091/85), pGKA2 (prepared from Escherichia coli IGKA2 (FERM BP-6798), Japanese Published Unexamined Patent 
Application No. 221091/85), pTerm2 (U.S. Patents 4,686,191, 4,939,094 and 5,160,735), pSupex, pUB110, pTP5, 
pC1 94 and pEG400 (J. Bacteriol., 1 72: 2392 (1 990)), pGEX (manufactured by Pharmacia), pET system (manufactured 
by Novagen), and the like. 

[0203] Any promoter can be used so long as it can function in the host cell. Examples include promoters derived 
so from Escherichia coli, phage and the like, such as trp promoter (P trp ) , lac promoter, P L promoter, P R promoter, T7 

promoter and the like. Also, artificially designed and modified promoters, such as a promoter in which two Pfrp are 

linked in series (P +rp x2) , tac promoter, /acT7 promoter /efl promoter and the like, can be used. 

[0204] It is preferred to use a plasmid in which the space between Shine-Dalgarno sequence which is the ribosome 

binding sequence and the initiation codon is adjusted to an appropriate distance (for example, 6 to 18 nucleotides). 
55 [0205] The transcription termination sequence is not always necessary for the expression of the DNA of the present 

invention. However, it is preferred to arrange the transcription terminating sequence at just downstream of the structural 

gene. 

[0206] One of ordinary skill in the art will appreciate that the codons of the above-described elements may be opti- 
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mlzed, in a known manner, depending on the host cells and environmental conditions utilized, 
[02071 Examples of the host cell include microorganisms belonging to the genus Escherichia, the genus Serratia, 
the genus Bacillus, the genus Brevibacterfum, the genus Corynebacterium, the genus Mlcrobacterium, the genus Pseu- 
domonas, and the like. Specific examples include Escherichia co//XL1-Blue, Escherichia coli XL2-Blue f Escherichia 
coll DH1 , Escherichia coli MC1000, Escherichia coll KY3276, Escherichia coli W1485, Escherichia coll JM109, Es- 
cherichia co//HB101 , Escherichia coliUo. 49, Escherichia coli W3110, Escherichia co//NY49, Escherichia CO//GI698, 
Escherichia coll TBI, Serratia ficaria, Serratia fonticola, Serratia liquefaciens, Serratia marcescens, Bacillus subtilfs, 
Bacillus amyioliquefaciens, Corynebacterium ammonia genes, Brevibacterium immariophilum ATCC 14068, Brevibac- 
teriumsaccharolyticum ATCC 14066, Corynebacterium glutamicum ATCC 13032, Corynebacterium glutamicum ATCC 
13869, Corynebacterium glutamicum ATCC 14067 (prior genus and species: Brevibacterium flavum), Corynebacterium 
glutamicum ATCC 13869 (prior genus and species: Brevibacterium lactofermentum, or Corynebacterium lactofermen- 
turn), Corynebacterium acetoacidophilum fiTCC 13870, Corynebacterium thermoaminogenes FERM 9244, Microbac- 
terium ammoniaphilum ATCC 15354, Pseudomonas putida, Pseudomonas sp. D-0110, and the like. 
[0208] When Corynebacterium glutamicum or an analogous microorganism is used as a host, an EMF necessary 
for expressing the polypeptide is not always contained in the vector so long as the polynucleotide of the present in- 
vention contains an EMF. When the EMF Is not contained in the polynucleotide, it is necessary to prepare the EMF 
separately and ligate it so as to be in operable combination. Also, when a higher expression amount or specific ex- 
pression regulation is necessary, it is necessary to ligate the EMF corresponding thereto so as to put the EMF In 
operable combination with the polynucleotide. Examples of using an externally ligated EMF are disclosed In Microbi- 
ology, 142. 1297-1309(1996). 

[0209] With regard to the method for the Introduction of the recombinant vector, any method for introducing DIMA into 
the above-described host cells, such as a method in which a calcium ion is used (Proc. Natl. Acad. Sci. USA, 69. 21 1 0 
(1972)), a protoplast method (Japanese Published Unexamined Patent Application No. 2483942/88), the methods 
described in Gene, 1T.AQ7 (1982) and Molecular & General Genetics, 168. 111 (1979) and the like, can be used. 
[0210] When yeast is used as the host cell, examples of the expression vector include pYES2 (manufactured by 
Invitrogen), YEp13 (ATCC 37115), YEp24 (ATCC 37051), YCp50 (ATCC 37419), pHS19, pHS15, and the like. 
[0211] Any promoter can be used so long as it can be expressed in yeast. Examples include a promoter of a gene 
in the glycolytic pathway, such as hexose kinase and the like, PHOS promoter, PGK promoter, GAP promoter, ADH 
promoter, gal 1 promoter, gal 1 0 promoter, a heat shock protein promoter, MF al promoter, CUP 1 promoter, and the like. 
[0212] Examples of the host cell include microorganisms belonging to the genus Saccharomyces, the genus 
Schizosaccharomyces, the genus Kluyveromyces, the genus Trichosporon, the genus Schwanniomyces, the genus 
Pichia, the genus Candida and the like. Specific examples include Saccharomyces cerevisiae, Schizosaccharomyces 
pombe, Kluyveromyces lactis, Trichosporon pullulans, Schwanniomyces alluvius, Candida util'ts and the like. 
[021 3] With regard to the method for the introduction of the recombinant vector, any method for introducing DNA into 
yeast, such as an electroporation method (Methods. EnzymoL, 194: 182 (1990)), a spheroplast method (Proc. Natl. 
Acad. Sci. USA, 75: 1929 (1978)), a lithium acetate method (J. Bacterid., 153: 163 (1983)), a method described in 
Proc. Natl. Acad. Sci. USA, 75: 1 929 (1 978) and the like, can be used. 

[021 4] When animal cells are used as the host cells, examples of the expression vector Include pcDNA3. 1 , pSinRepS 
and pCEP4 (manufactured by Invltorogen), pRev-Tre (manufactured by Clontech), pAxCAwt (manufactured by Takara 
Shuzo), pcDNAI and pcDM8 (manufactured by Funakoshi), pAGE107 (Japanese Published Unexamined Patent Ap- 
plication No. 22979/91; Cytotechnology, 3:133 (1990)), pAS3-3 (Japanese Published Unexamined Patent Application 
No. 227075/90), pcDM8 (Nature, 329: 840 (1987)), pcDNAI/Amp (manufactured by Invitrogen), pREP4 (manufactured 
by Invitrogen), pAGE103 (J. Biochem., 101: 1307 (1987)), pAGE210, and the like. 

[0215] Any promoter can be used so long as it can function in animal cells. Examples Include a promoter of IE 
(immediate early) gene of cytomegalovirus (CMV), an early promoter of SV40, a promoter of retrovirus, a metal- 
lothlonein promoter, a heat shock promoter, SRa promoter, and the like. Also, the enhancer of the IE gene of human 
CMV can be used together with the promoter. 

[0216] Examples of the host cell include human Namalwa cell, monkey COS cell, Chinese hamster CHO cell, 
HST5637 (Japanese Published Unexamined Patent Application No. 299/88), and the like. 

[0217] The method for introduction of the recombinant vector into animal cells is not particularly limited, so long as 
it is the general method for introducing DNA into animal cells, such as an electroporation method (Cytotechnology, 3 
133 (1990)), a calcium phosphate method (Japanese Published Unexamined Patent Application No. 227075/90), a 
lipofection method (Proc. Natl. Acad. Sci. USA, 84, 7413 (1987)), the method described in Virology, 52. 456 (1973), 
and the like. 

[0218] When insect cells are used as the host cells, the polypeptide can be expressed, for example, by the method 
described in Bacurovirus Expression Vectors, A Laboratory Manual, W.H. Freeman and Company, New York (1992), 
Bio/Technology, 6: 47 (1 988), or the like. 

[0219] Specifically, a recombinant gene transfer vector and bacurovirus are simultaneously inserted into insect cells 
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SfEXV^SSSfS-r -« » «ne «M include p«ueBac4. 6 . pVLIOK, »!.«• 
the family Barathra are infected, and the like. ^^w* n ,wtoc S f9 and Sf21 (Bacurovirus Expression 

f actured by Invitrogen) and the like. oK«u«^AQrrih«d recombinant qene transfer vector and the 

S wCSt ce. te are used as the host ce..s. examp.es of expression vector Inc.ude a Ti p.asmid. a tobacco 
M ^CorcTn^ed so .ong as It can be expressed in plant ce„s. Examp.es Mud. 35S promoter of 

or introducing DNA Into plant cells, such as ^^^robactenum ^^^o^e PuWtehea 
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an inducer can be added to the medium, if necessary. 

[0240] For example, Isopropyl-p-D-thiogalactopyranoside (IPTG) or the like can be added to the medium when a 
microorganism transformed with a recombinant vector containing lac promoter is cultured, or indoleacrylic acid (IAA) 
or the like can by added thereto when a microorganism transformed with an expression vector containing trp promoter 
is cultured. 

[0241] Examples of the medium used in culturing a transformant obtained using animal cells as the host cells include 
RPMI 1640 medium (The Journal of the American Medical Association, 759:519(1967)), Eagle's MEM medium(Se*- 
ence, 122: 501 (1952)), Dulbecco's modified MEM medium (Virology, 8, 396 (1959)), 199 Medium (Proceeding of the 
Society for the Biological Medicine, 73:1 (1 950)), the above-described media to which fetal calf serum has been added 
and the like. ' 

[0242] The culturing is carried out generally at a pH of 6 to 8 and a temperature of 30 to 40°C in the presence of 5% 
C0 2 for1 to 7 days. 

[0243] Also, if necessary, antibiotics, such as kanamycin, penicillin, and the like, can be added to the medium during 
the culturing. 

[0244] Examples of the medium used in culturing a transformant obtained using insect cells as the host cells include 
TNM-FH medium (manufactured by Pharmingen), Sf-900 II SFM (manufactured by Life Technologies), ExCell 400 and 
ExCell 405 (manufactured by JRH Biosciences), Grace's Insect Medium (Nature, 195: 788 (1962)), and the like. 
[0245] The culturing is carried out generally at a pH of 6 to 7 and a temperature of 25 to 30°C for 1 to 5 days. 
[0246] Additionally, antibiotics, such as gentamicin and the like, can be added to the medium during the culturing if 
necessary. 

[0247] A transformant obtained by using a plant cell as the host cell can be used as the cell or after differentiating 
to a plant cell or organ. Examples of the medium used in the culturing of the transformant include Murashlge and Skoog 
(MS) medium, White medium, media to which a plant hormone, such as auxin, cytokinine, or the like has been added 
and the like. 

[0248] The culturing is carried out generally at a pH of 5 to 9 and a temperature of 20 to 40°C for 3 to 60 days. 
[0249] Also, antibiotics, such as kanamycin, hygromycin and the like, can be added to the medium during the cul- 
turing, if necessary. 

[0250] As described above, the polypeptide can be produced by culturing a transformant derived from a microor- 
ganism, animal cell or plant cell containing a recombinant vector to which a DNA encoding the polypeptide of the 
present invention has been inserted according to the general culturing method to produce and accumulate the polypep- 
tide, and recovering the polypeptide from the culture. 

[0251] The process of gene expression may include secretion of the encoded protein production or fusion protein 
expression and the like in accordance with the methods described in Molecular Cloning, 2nd ed., in addition to direct 
expression. 

[0252] The method for producing the polypeptide of the present invention includes a method of intracellular expres- 
sion in a host cell, a method of extracellular secretion from a host cell, or a method of production on a host cell membrane 
outer envelope. The method can be selected by changing the host cell employed or the structure of the polypeptide 
produced. 

[0253] When the polypeptide of the present Invention is produced in a host cell or on a host cell membrane outer 
envelope, the polypeptide can be positively secreted extracellularty alxording to, for example, the method of Paulson 
etal. (J. Biol. Chem., 264: 17619 (1989)), the method of Lowe et al. (Proc. Natl. Acad. ScL USA, 86: 8227 (1989); 
Genes Develop., 4: 1288 (1 990)), and/or the methods described in Japanese Published Unexamined Patent Application 
No. 336963/93, WO 94/23021 , and the like. 

[0254] Specifically, the polypeptide of the present invention can be positively secreted extracellularty by expressing 
it In the form that a signal peptide has been added to the foreground of a polypeptide containing an active site of the 
polypeptide of the present Invention according to the recombinant DNA technique. 

[0255] Furthermore, the amount produced can be increased using a gene amplification system, such as by use of 
a dihydrofolate reductase gene or the like according to the method described in Japanese Published Unexamined 
Patent Application No. 227075/90. 

[0256] Moreover, the polypeptide of the present invention can be produced by a transgenic animal individual (trans- 
genic nonhuman animal) or plant individual (transgenic plant). 

[0257] When the transformant is the animal individual or plant individual, the polypeptide of the present invention 
can be produced by breeding or cultivating it so as to produce and accumulate the polypeptide, and recovering the 
polypeptide from the animal individual or plant individual. 

[0258] Examples of the method for producing the polypeptide of the present invention using the animal individual 
include a method for producing the polypeptide of the present invention in an animal developed by inserting a gene 
according to methods known to those of ordinary skill in the art (American Journal of Clinical Nutrition, 63: 639S (1 996), 
American Journal of Clinical Nutrition, 63. 627S (1 996), Bio/Technology, 9. 830 (1 991 )). 
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[0259] in the animal individual, the polypeptide can be produced by breeding a ""JJJJES^^ 

the DNA encoding the polypeptide of the present invention has been inserted to produce and accumulate the 

Sn^ 

in the animal inciude milk (Japanese Published Unexamined Patent ^* N ;^ 

the animal Anv promoter can be used, so long as it can be expressed in the animal. Suitable examples inc urn « 

^X^M^-I" P romoter - a Hactoglobulin promoter, a whey acidic protein promoter, and the like, which 

f026^~Z the ^peptide of the present invention «-"-•->« «*« 

nclude a method for producing the pofypeptide of the present Invention by cultivating a J-^^JJJ^ 
DNA encoding the protein of the present invention by a known method (T,ssue 

(1994), Trends in Biotechnology, 15: AS (1997)) to produce and accumulatethepolypept.de .ntheplant. and recovenng 

^1^* hS^^rtlng to the present invention can also be obtained by translation 
0262 The pojpeptide of the present invention can be produced by a translation system m a «\'° r 
example two to ^translation methods whichmay be used, namely, a memod using RN A as a template and a^ 
3d usTngSNA as a template. The template RNA includes the who.e RNA. mRNA, an in ^^Sm^S 
Tnd the like. The template DNA includes a plasmid containing a transcriptions 'P™<^^ • SSSSSS 
therein and downstream of the initiation site, a PCR/RT-PCR product and tte fcto J^2^ - 2S!l^ 
for the in vitro translation, the origin of the gene encoding the protein to be 

cell) the type of the template (DNA/RNA), the purpose of using the synthesized protein and the like should > be eonstf 
ered "n Sfo translation kits having various characteristics are commercially available from many companies (Boe- 
SefMtnS. Promega, Strata'gene, or the .ike), and every kit can be used in producing the polypeptide according 

lo^^sXSansiatton of a DNA nuc.eot.de sequence Coned Into • 

be earned out using an In vitro transcriptionAranslation system E eoKTJ S30 Extract System for Circular DNA ^(man- 
ufactured by Promega. catalogue No. L1130). Also. transcriptionAranslation using, as a template, a linear P"*W*° 
DN?o7a supeiS. non-sensiSve promote, such as /acUVS. tac, XPL(con), XPL. or the like, "*^£«™B 
an in vftrotranscriptionrtmnslation system £. «tf S30 Extract System for Linear Temp 

catalogue No L1030). Examples of the linear prokaryotic DNA used as a template include a DNA fragment^ a I PCR 
Sm pZa duplicated oiigonudeotide ligation, an in vitro transcriptional RNA. a prokaryotc RNA. and 



SST addition to the production of the polypeptide according to the Present invention ^^J^SSZ 
labeled protein, confirmation of the expression capability of a cloned gene, analysis of the function of transcnptional 
reaction or translation reaction, and the like can be carried out using this system. 

TO265] The polypeptide produced by the transformant of the present invention can be isolated and punted using the 
Lenera. method isolating and purifying an enzyme. For examp.e. when the polypeptide of the present inventon • 
expressed as a soluble product in the host cells, the cells are collected by centrif ugatlon after cultivation, suspended 

or the Hke to obtain a cell-free extract. From the supernatant obtained by centnfug.ng the c ^- ,r ^^J.^^ 
product can be obtained by the genera, method used for isolating and purifying an enzyme, for example solvent ex- 
traction, salting out using ammonium suifate or the like, desalting precipftation using an 

change chromatography using a resin, such as diethylamlnoethyl (DEAE -Sepharose H^7^a^8^8d 
by Mitsubishi Chemical) or the like, cation exchange chromatography using a resin, such as S-S<£utfoae J^™" 
f actured by Pharmacia) or the like, hydrophobic chromatography using a resin. such as buy 

rose or the like, gel filtration using a molecular sieve, affinity chromatography, chromatofocusing. or electrophoresis, 
such as isoelectronic focusing or the like, alone or in combination thereof ~ llaete d u> the 

[0266J When the polypeptide is expressed as an insoluble product in ^^^^^^SStM^ 
same manner. disrupted Tend centrifuged to recover the insoluble product of the P^^-^^^*^ 
Next, the insoluble product of the polypeptide is solubilized with a protein denaturing agent. The sol uUlized so rton 
I diluted or dlalyzeS to lower the concentration of the protein denaturing agent in the solution^ Thus the normal cor, 
figuration of the polypeptide is reconstituted. After the procedure, a purified product of the polypepbde can be obtained 
by a purification/isolation method similar to the above. , . n Hdina 

K3ST] When the polypeptide of the present invention or its derivative (for example, a polypeptide formed by adding 

asug^archaire^ 

Namely, the culture supernatant is obtained by treating the culture medium in a ^f^r^^SZZ 
example,centrifugation).Then,apurrfiedproduct can be obtained from the culture medium using a punfication/isolation 

method similar to the above. n „ eant invention 

[0268] The polypeptide obtained by the above method is within the scope of the polypept.de of the present invention, 
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and examples include a polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from 
SEQ ID NOS:2 to 3431, and a polypeptide comprising an amino acid sequence represented by any one of SE(J ID 
NOS:3502to6931. 

[0269] Furthermore, a polypeptide comprising an amino acid sequence in which at least one amino acids is deleted, 

s replaced, inserted or added in the amino acid sequence of the polypeptide and having substantially the same activity 
as that of the polypeptide is included in the scope of the present invention. The term "substantially the same activity 
as that of the polypeptide" means the same activity represented by the inherent function, enzyme activity or the like 
possessed by the polypeptide which has not been deleted, replaced, inserted or added. The polypeptide can be ob- 
tained using a method for introducing part-specific mutation(s) described in, for example, Molecular Cloning, 2nd ed., 

io Current Protocols in Molecular Biology, Nuc. Acids. Res., 10: 6487 (1982), Proc. Natl. Acad. Sci. USA, 79. 6409 (1982), 
Gene, 34: 315 (1985), Nuc. Acids. Res., 13: 4431 (1985), Proc. Natl. Acad. Sci. USA, 82: 488 (1985) and the like. For 
example, the polypeptide can be obtained by introducing mutation (s) to DNA encoding a polypeptide having the amino 
acid sequence represented by any one of SEQ ID NOS:3502 to 6931 . The number of the amino acids which are deleted, 
replaced, inserted or added is not particularly limited; however, it is usually 1 to the order of tens, preferably 1 to 20, 

* 5 more preferably 1 to 1 0, and most preferably 1 to 5, amino acids. 

[0270] The at least one amino acid deletion, replacement, insertion or addition in the amino acid sequence of the 
polypeptide of the present invention is used herein to refer to that at least one amino acid is deleted, replaced, inserted 
or added to at one or plural positions in the amino acid sequence. The deletion, replacement, insertion or addition may 
be caused in the same amino acid sequence simultaneously. Also, the amino acid residue replaced, inserted or added 

20 can be natural or non-natural. Examples of the natural amino acid residue include L-alanine, L-asparagine, L-asparatic 
acid, L-glutamine, L-glutamic acid, glycine, L-histidine, L-isoleuclne, L-leucine, L-lysine, L-methionlne, L-phenylalanine, 
L-proline, L-serine, L-threonlne, L-tryptophan, L-tyrosine, L-valine, L-cystelne, and the like. 

[0271] Herein, examples of amino acid residues which are replaced with each other are shown below. The amino 
acid residues In the same group can be replaced with each other. 

25 

Group A: 

[0272] leucine, isoleucine, norieucine, valine, norvaline, alanine, 2-aminobutanoic acid, methionine, O-methylserine, 
t-butylglycine, t-butylalanine, cyclohexylalanine; 

30 

Group B: 

[0273] asparatic acid, glutamic acid, isoasparatic acid, isoglutamic acid, 2-aminoadipic acid, 2-aminosuberic acid; 
35 Group C: 

j 

[0274] asparagine, glutamine; 

Group.D: ... 

40 

[0275] lysine, arginine, ornithine, 2,4-diaminobutanoic acid, 2,3-diaminopropionic acid; 
Group E: 

4* [0276] proline, 3-hydroxyprollne, 4-hydroxyproline; 
Group F: 

[0277] serine, threonine, homoserine; 

50 

Group G: 

[0278] phenylalanine, tyrosine. 

[0279] Also, in order that the resulting mutant polypeptide has substantially the same activity as that of the polypeptide 
55 which has not been mutated, it is preferred that the mutant polypeptide has a homology of 60% or more, preferably 
80% or more, and particularly preferably 95% or more, with the polypeptide which has not been mutated, when calcu- 
lated, for example, using default (initial setting) parameters by a homology searching software, such as BLAST, FASTA, 
or the like. 
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fO2801 Also the polypeptide of the present invention can be produced by a chemical synthesis method^ such as 

«^!nl O oiri and analoaues thereof can be produced by culturing the transformant containing the polynucleotide or 
^Sin^ ina medium to produce and accumulate at ,east one component selected 



ST "eSsynthesfe pathways, decomposer, pathways and regu.atory — * ^^^^ 
substances such as amino acids, nucleic acids, vitamins, saccharides, organic acids and analogues thereo ' ^JJ™ 
oSm to omanism. The productivity of such a physiologically active substance can be improved us.ng t hese drffer- 
encesspSSybyin^ 

7£?S» 8 one of the essential amino acids. In a plant seed was improved by introducing a synthase^ gene 

synthase gene derived from Escherichia coli (Japanese Examined Patent Publication 2375<V93) 

To produce such a physiologically active substance, the transformant accord, njr ^Z^nt 
be cultured by the same method as employed in culturing the transformant for producing the polypeptide of ttie present 
Son « Md above. Also, the physiologically active substance can be «««^2^^ B ^ 
n combination with for example, the ion exchange resin method, the precipitation method and other known methods. 
028^ S^^io* known to one of ordinary skill in the art include '^"f^^^S^ 
he protoplast method, the method using a phage, and the like, when the host tea '^SJ^.^TSS 

P^. 1 998)) Examples of the host include prokaryotes. lower eukaryotes (for example y«*te). 
^SS^ZZL*. mammals), and cells isolated therefrom. As the state j^^P^S 
fragmentpresentbi Ithe hostcells.it can be integrated into the chromosome of fte host. Mema^ 
nto a factor (for example a plasmid) having an independent replication unit outside the chromosome These trans 
S^^^T^S** the polypeptides of the present invention encoded by the ORF genome of 
Corvnebacferiu/n glutamicum. the polynucleotides of the present inventjon and fragments thereof. Alternatively, they 
caTbT^TroduTg arbirary polypeptides under the regulation by an EMF of the present invent™. 

11. Preparation of antibody recognizing the polypeptide of the present invention 

[0286] An antibody which recognizes the polypeptide of the present invention, such as a P^^SS^Zrt 
octonal antibody. orL like, can be produced using, as an antigen, a purified product of the polypeptide , o the present 
Stton or aSrtial fragment polypeptide of the polypeptide or a peptide having a partial amino acd sequence of the 
polypeptide of the present invention. 

(1) Production of polyclonal antibody 

102871 A polyclonal antibody can be produced using, as an antigen, a purified product of the polypeptide of the 
preset Ze^Tl^Z™* polypeptide of the polypeptide, or a peptide having a partial ammo acid sequence 

S3 Wn^nTep^^^ 

SSSVin, bovine thyrog.obu.in. or the like. The peptide used as the antigen can be synthesized 

f^rteS^ation of the antigen is. for example, carried out 3 to 10 times at the intervals ofl or 2 1 weeks 
StVr the fimt admSration On the 3rt to 7th day after each administration, a blood sample is collected from the 
venousplelo^ 

t&tzyrne-linked Immunosorbent Assay (EUSA), Igaku Shoin (1976) ; Antibodies - A Laboratory Manual, Cold Spnng 
Ste^ 

used for the immunization, and the serum is isolated and purified to obtain a polyclonal ant.body. 
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[0293] Examples of the method for the isolation and purification include centrifugatlon, salting out by 40-50% satu- 
rated ammonium sulfate, capryllc acid precipitation (Antibodies, A Laboratory manual, Cold Spring Harbor Laboratory 
(1988)), or chromatography using a DEAE-Sepharose column, an anion exchange column, a protein A- or G-column, 
a gel filtration column, and the like, alone or in combination thereof, by methods known to those of ordinary skill In the art! 

(2) Production of monoclonal antibody 

(a) Preparation of antibody-producing cell 

[0294] A rat having a serum showing an enough antibody titer against a partial fragment polypeptide of the polypep- 
tide of the present Invention used for immunization is used as a supply source of an antibody-producing cell. 
[0295] On the 3rd to 7th day after the antigen substance is finally administered the rat showing the antibody titer the 
spleen is excised. 

[0296] The spleen Is cut to pieces In MEM medium (manufactured by Nissul Pharmaceutical), loosened using a pair 
of forceps, followed by centrifugation at 1 ,200 rpm for 5 minutes, and the resulting supernatant is discarded. 
[0297] The spleen in the precipitated fraction is treated with a Tris-ammonium chloride buffer (pH 7.65) for 1 to 2 
minutes to eliminate erythrocytes and washed three times with MEM medium, and the resulting spleen cells are used 
as antibody-producing cells. 

(b) Preparation of myeloma cells 

[0298] As myeloma cells, an established cell line obtained from mouse or rat is used. Examples of useful cell lines 
include those derived from a mouse, such as P3-X63Ag8-U1 (hereinafter referred to as "P3-U1 ") (Curr. Topics in Micro- 
biol Immunol., 81: 1 (1978); Europ. J. Immunol., 6: 511 (1976)); SP2/0-Agl4 (SP-2) (Nature, 276. 269 (1978))- 
P3-X63-Ag8653 (653) (J. Immunol., 123: 1548 (1979)); P3-X63-Ag8 (X63) cell line (Nature, 256: 495 (1975)), and the 
like, which are 8-azaguanine-resistant mouse (BALB/c) myeloma cell lines. These cell lines are subcultured In 8-aza- 
guanine medium (medium in which, to a medium obtained by adding 1.5 mmol/l glutamine, 5X10- 5 molfl 2-mercap- 
toethanol, 10 u.g/ml gentamicin and 10% fetal calf serum (FCS) (manufactured by CSL) to RPMI-1640 medium (here- 
inafter referred to as the "normal medium"), 8-azaguanine is further added at 15 u.g/ml) and cultured in the normal 
medium 3 or 4 days before cell fusion, and 2x1 0 7 or more of the cells are used for the fusion. 

(c) Production of hybridoma 

[0299] The antibody-producing cells obtained in (a) and the myeloma cells obtained in (b) are washed with MEM 
medium or PBS (dlsodium hydrogen phosphate: 1 .B3 g, sodium dihydrogen phosphate: 0.21 g, sodium chloride: 7.65 
g, distilled water 1 liter, pH: 7.2) and mixed to give a ratio of antibody-producing ceils : myeloma cells = 5 : 1 to 10 : 
1 , followed by centrifugation at 1 ,200 rpm for 5 minutes, and the supernatant is discarded. 

[0300] The cells in the resulting precipitated fraction were thoroughly loosened, 0.2 to 1 ml of a mixed solution of 2 
g of polyethylene glycol-1000 (PEG-1000), 2 ml of JvlEM medium and 0.7 ml of dimethylsulf oxide (DMSO) per 10 8 
antibody-producingcells is "added to the cells" under stirring at 37°C, and then 1 to 2 ml of MEM medium is further 
added thereto several times at 1 to 2 minute intervals. 

[0301] After the addition, MEM medium is added to give a total amount of 50 ml. The resulting prepared solution Is 
centrifuged at 900 rpm for 5 minutes, and then the supernatant is discarded. The cells in the resulting precipitated 
fraction were gentry loosened and then gently suspended in 1 00 ml of HAT medium (the normal medium to which 1 0" 4 
mol/l hypoxanthine, 1.5x10-5 mol/l thymidine and 4x10' 7 mol/l aminopterin have been added) by repeated drawing 
up into and discharging from a measuring pipette. 

[0302] The suspension is poured into a 96 well culture plate at 1 00 uJ/well and cultured at 37°C for 7 to 14 days in 
a 5% CO a Incubator. 

[0303] After culturing, a part of the culture supernatant is recovered, and a hybridoma which specifically reacts with 
a partial fragment polypeptide of the polypeptide of the present invention is selected according to the enzyme immu- 
noassay described in Antibodies, A Laboratory manual, Cold Spring Harbor Laboratory, Chapter 14(1 998) and the like. 
[0304] A specific example of the enzyme immunoassay is described below. 

[0305] The partial fragment polypeptide of the polypeptide of the present invention used as the antigen in the immu- 
nization is spread on a suitable plate, is allowed to react with a hybridoma culturing supernatant or a purified antibody 
obtained in (d) described below as a first antibody, and is further allowed to react with an anti-rat or anti-mouse immu- 
noglobulin antibody labeled with an enzyme, a chemical luminous substance, a radioactive substance or the like as a 
second antibody for reaction suitable for the labeled substance. A hybridoma which specifically reacts with the polypep- 
tide of the present invention Is selected as a hybridoma capable of producing a monoclonal antibody of the present 
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producing a monoclonal antibody of the present invention, 
(d) Preparation of monoclonal antibody 

10307] The monoclonal antibody-producing hybridoma ceils obtainec ^J^jJJJ SJS^SKSS^ 

S Tjr^dtTonected from the mice or nude mice, and centred to remove solid contents at 3000 
[Si? ^totclona, antibody can be purified and isoiated from the resuiting supernatant according to the method 

ZSfZXXSZ* t »» — > «*» «- ^its^ -ESSES- 

Ccu-i Vol. 5. Immunobiochemistry Research Method, Tokyo Kagaku Dojin ( M J). 

(e.g.. horseradish peroxidase, alkaline phosphatase, or the l.ke), a «uo escence taJM«|. ^ ^ 
like), a label using a rhodamine atom, (J. Hfetochem. Cytocfcem tft 316 (1 970), Mem. tnzym., i 

using the antibody of the present invention. 
12. Production and use of polypeptide array 
(1) Production of polypeptide array 

,03,7]- A polyp**., n « be produced ue.ng ft. po.yp.p1id. rt the , pMtt *««*». obtfteed ft ft. Mft. 

polyacrylamide or the like; complex carbohydrates, such as agarose, sepharose, or the l.ke. silica, a shea based ma 

Enzym 34 (1974); Advances in Experimental Medicine and Biology, 42 (1 974); U.S. Patent 4.681 .870. U.S. latent 

e^ent-y performed by adhering the polypeptide or antibody of the 
5S2t in^enSonTthe solid support at a high density, though a high fixation density is not always necessary. 



f 
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(2) Use of polypeptide array 

[0322] A polypeptide or a compound capable of binding to and interacting with the polypeptides of the present In- 
vention adhered to the array can be identified using the polypeptide array to which the polypeptides of the present 
invention have been adhered thereto as described in the above (1 ). 

[0323] Specif icaliy, a polypeptide or a compound capable of binding to and interacting with the polypeptides of the 
present invention can be identified by subjecting the polypeptides of the present invention to the following steps (I) to (iv): 

(0 preparing a polypeptide array having the polypeptide of the present invention adhered thereto by the method 
of the above (1); 

(ii) incubating the polypeptide immobilized on the polypeptide array together with at least one of a second polypep- 
tide or compound; 

(iii) detecting any complex formed between the at least one of a second polypeptide or compound and the polypep- 
tide immobilized on the array using, for example, a label bound to the at least one of a second polypeptide or 
compound, or a secondary label which specifically binds to the complex or to a component of the complex after 
unbound material has been removed; and 

(iv) analyzing the detection data. 

[0324] Specific examples of the polypeptide array to which the polypeptide of the present invention has been adhered 
include a polypeptide array containing a solid support to which at least one of a polypeptide containing an amino acid 
sequence selected from SEQ ID NOS:3502 to 7001, a polypeptide containing an amino acid sequence in which at 
least one amino acids is deleted, replaced, inserted or added in the amino acid sequence of the polypeptide and having 
substantially the same activity as that of the polypeptide, a polypeptide containing an amino acid sequence having a 
homology of 60% or more with the amino acid sequences of the polypeptide and having substantially the same activity 
as that of the polypeptides, a partial fragment polypeptide, and a peptide comprising an amino acid sequence of a part 
of a polypeptide. 

[0325] The amount of production of a polypeptide derived from corynef orm bacteria can be analyzed using a polypep- 
tide array to which the antibody of the present invention has been adhered in the above (1). 

[0326] Specifically, the expression amount of a gene derived from a mutant of coryneform bacteria can be analyzed 
by subjecting the gene to the following steps (i) to (iv): 

(i) preparing a polypeptide array by the method of the above (1 ); 

(ii) incubating the polypeptide array (the first antibody) together with a polypeptide derived from a mutant of co- 
ryneform bacteria; 

(iii) detecting the polypeptide bound to the polypeptide immobilized on the array using a labeled second antibody 
of the present invention; and 

(iv) analyzing the detection data. 

[0327] Specific examples of the polypeptide array to which the antibody of the present invention is adhered include 
a polypeptide array comprising a solid support to which at least one of an antibody which recognizes a polypeptide 
comprising an amino acid sequence selected from SEQ ID NOS:3502 to 7001, a polypeptide comprising an amino 
acid sequence in which at least one amino acids is deleted, replaced, inserted or added in the amino acid sequence 
of the polypeptide and having substantially the same activity as that of the polypeptide, a polypeptide comprising an 
amino acid sequence having a homology of 60% or more with the amino acid sequences of the polypeptide and having 
substantially the same activity as that of the polypeptides, a partial fragment polypeptide, or a peptide comprising an 
amino acid sequence of a part of a polypeptide. 

[0328] A fluctuation in an expression amount of a specific polypeptide can be monitored using a polypeptide obtained 
In the time course of culture as the polypeptide derived from coryneform bacteria. The culturing conditions can be 
optimized by analyzing the fluctuation. 

[0329] When a polypeptide derived from a mutant of coryneform bacteria is used, a mutated polypeptide can be 
detected. 

13. Identification of useful mutation in mutant by proteome analysis 

[0330] Usually, the proteome is used herein to refer to a method wherein a polypeptide is separated by twodimen- 
sional electrophoresis and the separated polypeptide is digested with an enzyme, followed by identification of the 
polypeptide using a mass spectrometer (MS) and searching a data base. 

[0331] The two dimensional electrophoresis means an electrophoretic method which is performed by combining two 
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n the S» productivity can be identified. For exampie. as a resuit of the proteome analyse on a wUd type *wn and 
L lystne-producing st rain, the productivity of the catalase having the amino ac,d sequence represented by SEQ ID NO. 

5^1. ™S^T„miln M> the mod«leii protein cur be etrieierrrly ldennfleduslno.rhe recording mod.urrr 

is not limited thereto. 
Example 1 

Determination of the full nucleotide sequence of genome of Corynebacterium glutamicum 

103401 The full nucleotide sequence of the genome of Corynebacterium glutamicum was determined based I on the 
whZ genome shotgun method (Science. 26* 496-512 (1995)). In this method, a genome 

me ^tSnal sequences were deiermined at random. Subsequently, these sequences were ..gated on a computer to 
cover the full genome. Specifically, the following procedure was carried out. 

(1) Preparation of genome DNA of Corynebacterium glutamicum ATCC 13032 

.03411 Coiynebacferfi/m glutamicum ATCC 13032 was cultured in BY medium (7 g/l meat extract. 1 3 
5£dJ IS 5 g/l yeast extract. pH 7.2) containing 1 % of glycine at ^J^^^^^^T^ 
bv cZ rifuaaion AtteT washing with STE buffer (10.3% sucrose. 25 mmol/l Tris hydrochloride. 25 mmoUl E DTA , | pH 
e oTTe ^irwereTuspended in 10 ml of STE buffer containing 10 mg/m. lysozyme. followed by gently shaWng « 

3 £cTor^^ 
a r^CforTo^ 

followed bv aentlv shaking at room temperature for 30 minutes and centnfugation (15,000 x g, 20 minutes, 20 C^Tne 
aouTous^ 

KE!2^E£tS aqueous .aver, 3 moM sodium acetate solution (pH 5.2) and isopropanol were added at 
1/10 timTvSume and twice volume, respectively, foHowed by gently stimng to 

oenome DNA was dissolved again in 3 ml of TE buffer (1 0 mmol/l Tris hydrochlonde, 1 mmol/l EDTA. pH B.0) coining 
0 02 mg/m. of Sase and maintained at 37-C for 45 minutes. The extractions w.th P^j^^^ £ 
chloroform were carried out successively in the same manner as the above. The genome DNA was subjected to .so 
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propanol precipitation. The thus formed genome DNA precipitate was washed with 70% ethanol three times, followed 
by alr-drying f and dissolved in 1 .25 ml of TE buffer to give a genome DNA solution (concentration: 0.1 mg/ml). 

(2) Construction of a shotgun library 

[0342] TE buffer was added to 0.01 mg of the thus prepared genome DNA of Corynebacterium glutamicum ATCC 
13032 to give a total volume of 0.4 mi, and the mixture was treated with a sonicator (Yamato Powersonic Model 150) 
at an output of 20 continuously for 5 seconds to obtain fragments of 1 to 10 kb. The genome fragments were blunt- 
ended using a DNA blunting kit (manufactured by Takara Shuzo) and then fractionated by 6% poly aery lam ide gel 

10 electrophoresis. Genome fragments of 1 to 2 kb were cut out from the gel, and 0.3 ml MG elution buffer (0.5 mold 
ammonium acetate, 1 0 mmol/l magnesium acetate, 1 mmot/i EDTA, 0. 1 % SDS) was added thereto, followed by shaking 
at 37°C overnight to elute DNA. The DNA eluate was treated with phenol/chloroform, and then precipitated with ethanol 
to obtain a genome library insert. The total insert and 500 ng of pUC1 8 Smal/BAP (manufactured by Amersham Phar- 
macia Biotech) were iigated at 1 6°C for 40 hours. 

15 [0343] The ligation product was precipitated with ethanol and dissolved in 0.01 ml of TE buffer. The ligation solution 
(0.001 ml) was introduced into 0.04 ml of E. coll ELECTRO MAX DH1 0B (manufactured by Life Technologies) by the 
electroporation under conditions according to the manufacture's instructions. The mixture was spread on LB plate 
medium (LB medium (1 0 g/l bactotrypton, 5 g/l yeast extract, 1 0 g/I sodium chloride, pH 7.0) containing 1 .6% of agar) 
containing 0.1 mg/ml ampiciilin, 0.1 mg/ml X-gal and 1 mmol/l isopropyl-p-D-thiogalactopyranoside (IPTG) and cultured 

so at 37°C overnight. 

[0344] The transformant obtained from colonies formed on the plate medium was stationarily cultured in a 96-well 
titer plate having 0.05 ml of LB medium containing 0.1 mg/ml ampiciilin at 37°C overnight. Then, 0.05 ml of LB medium 
containing 20% glycerol was added thereto, followed by stirring to obtain a glycerol stock. 

25 (3) Construction of cosmid library 

[0345] About 0.1 mg of the genome DNA of Corynebacterium glutamicum ATCC 13032 was partially digested with 
SauSAI (manufactured by Takara Shuzo) and then ultracentrifuged (26,000 rpm, 18 hours, 20°C) under 10 to 40% 
sucrose density gradient obtained using 10% and 40% sucrose buffers (1 mol/l NaCI, 20 mmol/l Tris hydrochloride, 5 

30 mmol/l EDTA, 1 0% or 40% sucrose, pH 8.0). After the centrifugation, the solution thus separated was fractionated into 
tubes at 1 ml in each tube. After confirming the DNA fragment length of each fraction by agarose gel electrophoresis, 
a fraction containing a large amount of DNA fragment of about 40 kb was precipitated with ethanol. 
[0346] The DNA fragment was iigated to the BamH\ site of superCosI (manufactured by Stratagene) in accordance 
with the manufacture's instructions. The ligation product was incorporated into Escherichia coti XL-1-BlueMR strain 

35 (manufactured by Stratagene) using Gigapack HI Gold Packaging Extract (manufactured by Stratagene) in accordance 
with the manufacture's instructions. The Escherichia coii was spread on LB plate medium containing 0.1 mg/ml amp- 
iciilin and cultured therein at 37°C overnight to Isolate colonies. The resulting colonies were stationarily cultured at 
37°C overnight in a 96-well titer plate containing 0.05 mi of the LB medium containing 0.1 mg/ml ampiciilin in each 
well. LB medium containing 20% glycerol (0.05 ml) was added thereto, followed by stirring to obtain a glycerol stock. 

40 " 

(4) Determination of nucleotide sequence 
(4-1) Preparation of template 

45 [0347] The full nucleotide sequence of Corynebacterium glutamicum ATCC 1 3032 was determined mainly based on 
the whole genome shotgun method. The template used in the whole genome shotgun method was prepared by the 
PCR method using the library prepared in the above (2). 

[0348] Specifically, the clone derived from the whole genome shotgun library was inoculated using a replicator (man- 
ufactured by GENETIX) into each well of a 96-well plate containing the LB medium containing 0.1 mg/ml of ampiciilin 

so at 0.08 ml per each well and then stationarily cultured at 37*C overnight. 

[0349] Next, the cuituring solution was transported using a copy plate (manufactured by Tokken) into a 96-well re- 
action plate (manufactured by PE Biosystems) containing a PCR reaction solution (TaKaRa Ex Taq (manufactured by 
Takara Shuzo)) at 0.08 ml per each well. Then, PCR was earned out in accordance with the protocol by Makino etal. 
(DNA Research, 5: 1-9 (1998)) using GeneAmp PCR System 9700 (manufactured by PE Biosystems) to amplify the 

55 inserted fragment. 

[0350] The excessive primers and nucleotides were eliminated using a kit for purifying a PCR production (manufac- 
tured by Amersham Pharmacia Biotech) and the residue was used as the template in the sequencing reaction. 
[0351] Some nucleotide sequences were determined using a double-stranded DNA plasmid as a template. 
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[0352] The double-stranded DNA plasmid as the template was obtained by the following method, 
[0353] The clone derived from the whole genome shotgun library was Inoculated into a 24- or 96-well plate containing 
a 2X YT medium (1 6 g/l bactotrypton, 1 0 g/l yeast extract, 5 g/l sodium chloride, pH 7.0) containing 0.05 mg/ml ampicilhn 
at 1 .5 ml per each well and then cultured under shaking at 37°C overnight. 

[0354] The double-stranded DNA plasmid was prepared from the culturing solution using an automatic plasmid pre- 
paring machine, KURABO PI-50 (manufactured by Kurabo Industries) or a multiscreen (manufactured by Millipore) in 
accordance with the protocol provided by the manufacturer. 

[0355] To purify the double-stranded DNA plasmid using the multiscreen, Biomek 2000 (manufactured by Beckman 
Coulter) or the like was employed. 

[0356] The thus obtained double-stranded DNA plasmid was dissolved in water to give a concentration of about 0.1 
mg/ml and used as the template in sequencing. 

(4-2) Sequencing reaction 

[0357] To 6 uJ of a solution of ABi PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit (manufactured 
by PE Biosystems), an M13 regular direction primer (M13-21) or an M13 reverse direction primer (M13REV) (DAW 
Research. 5: 1 -9 (1 998) and the template prepared in the above (4-1 ) (the PCR product or the plasmid) were added 
to give 10 uJ of a sequencing reaction solution. The primers and the templates were used In an amount of 1 .6 pmol 

and an amount of 50 to 200 ng, respectively. ^ 

[0358] Dye terminator sequencing reaction of 45 cycles was carried out with GeneAmp PCR System 9700 (manu- 
factured by PE Biosystems) using the reaction solution. The cycle parameter was determined in accordance with the 
manufacturer's instruction accompanying ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit. The 
sample was purified using Multiscreen HV plate (manufactured by Millipore) according to the manufacture's instruc- 
tions. The thus purified reaction product was precipitated with ethanol, followed by drying, and then stored in the dark 
at -30° C 

[0359] The dry reaction product was analyzed by ABI PRISM 377 DNA Sequencer and ABI PRISM 3700 DNA An- 
alyzer (both manufactured by PE Biosystems) each in accordance with the manufacture's instructions. 
[0360] The data of about 50,000 sequences in total (i.e. , about 42,000 sequences obtained using 377 DNA Sequenc- 
er and about 8,000 reactions obtained by 3700 DNA Analyser) were transferred to a server (Alpha Server 41 00: man- 
ufactured by COMPAQ) and stored. The data of these about 50,000 sequences corresponded to 6 times as much as 
the genome size. 



(5) Assembly 

[0361] All operations were carried out on the basis of UNIX platform. The analytical data were output in Macintosh 
platform using X Window System. The base call was carried out using phred (The University of Washington). The 
vector sequence data was deleted using SPS Crossmatch (manufactured by Southwest Parallel Software). The as- 
sembly was carried out using SPS phrap (manufactured by Southwest Parallel Software; a high-speed version of phrap 
(The University of Washington)). The contig obtained by the assembly was analyzed using a graphical editor, consed 
(The University of Washington). A series of the operations from the base call to the assembly were carried out simul- 
taneously using a script phredPhrap attached to consed. 



(6) Determination of nucleotide sequence in gap part 

[0362] Each cosmid in the cosmid library constructed in the above (3) was prepared by a method similar to the 
preparation of the double-stranded DNA plasmid described in the above (4-1 ). The nucleotide sequence at the end of 
the inserted fragment of the cosmid was determined by using ABI PRISM BigDye Terminator Cycle Sequencing Ready 
Reaction Kit (manufactured by PE Biosystems) according to the manufacture's instructions. 

[0363] About 800 cosmid clones were sequenced at both ends to search a nucleotide sequence in the contig derived 
from the shotgun sequencing obtained in the above (5) coincident with the sequence. Thus, the linkage between re- 
spective cosmid clones and respective contigs were determined and mutual alignment was carried out. Furthermore, 
the results were compared with the physical map of Corynebacterium glutamicum ATCC 13032 {Mot. Gen. Genet, 
2S2. 255-265 (1 996) to carrying out mapping between the cosmids and the contigs. 

[0364] The sequence in the region which was not covered with the contigs was determined by the following method. 
[0365] Clones containing sequences positioned at the ends of contigs were selected. Among these clones, about 
1 ,000 clones wherein only one end of the inserted fragment had been determined were selected and the sequence at 
the opposite end of the inserted fragment was determined. A shotgun library clone or a cosmid clone containing the 
sequences at the respective ends of the inserted fragment in two contigs was identified, the full nucleotide sequence 



34 



EP1 108 790 A2 



of the inserted fragment of this clone was determined, and thus the nucleotide sequence of the gap part was determined. 
When no shotgun library clone or cosmid clone covering the gap part was available, primers complementary to the 
end sequences at the two contlgs were prepared and the DNA fragment in the gap part was amplified by PCR. Then, 
sequencing was performed by the primer walking method using the amplified DNA fragment as a template or by the 
5 shotgun method in which the sequence of a shotgun clone prepared from the amplified DNA fragment was determined. 
Thus, the nucleotide sequence of the domain was determined. 

[0366] In a region showing a low sequence precision, primers were synthesized using AUTOFINISH function and 
NAVIGATING function of consed (The University of Washington) and the sequence was determined by the primer 
walking method to improve the sequence precision. The thus determined full nucleotide sequence of the genome of 
10 Corynebacterium gfutamicum ATCC 1 3032 strain is shown in SEQ ID NO:1 . 

(7) Identification of ORF and presumption of its function 

[0367] ORFs in the nucleotide sequence represented by SEQ ID NO:1 were identified according to the following 
15 method. First, the ORF regions were determined using software for identifying ORF, i.e., Glimmer, GeneMark and 
GeneMark.hmm on UNIX platform according to the respective manual attached to the software. 
[0368] Based on the data thus obtained, ORFs in the nucleotide sequence represented by SEQ ID NO:1 were iden- 
tified. 

[0369] The putative function of an ORF was determined by searching the homology of the identified amino acid 
20 sequence of the ORF against an amino acid database consisting of protein-encoding domains derived from Swiss- 
Prot, PIR or Genpept database constituted by protein encoding domains derived from GenBank database, Frame 
Search (manufactured by Compugen), or by searching the homology of the identified amino acid sequence of the ORF 
against an amino acid database consisting of protein -encoding domains derived from Swiss-Prot, PIR or Genpept 
database constituted by protein encoding domains derived from GenBank database, BLAST. The nucleotide sequences 
25 of the thus determined ORFs are shown in SEQ ID NOS:2 to 3501 , and the amino acid sequences encoded by these 
ORFs are shown in SEQ ID NOS:3502 to 7001 . 

[0370] In some cases of the sequence listings in the present invention, nucleotide sequences, such as TTG, TGT, 
GGT, and the like, other than ATG, are read as an initiating codon encoding Met. 

[0371] Also, the preferred nucleotide sequences are SEQ ID NOS:2 to 355 and 357 to 3501 , and the preferred amino 

30 acid sequences are shown in SEQ ID NOS:3502 to 3855 and 3857 to 7001 

[0372] Table 1 shows the registration numbers in the above-described databases of sequences which were judged 
as having the highest homology with the nucleotide sequences of the ORFs as the results of the homology search in 
the amino acid sequences using the homology-searching software Frame Search (manufactured by Compugen), 
names of the genes of these sequences, the functions of the genes, and the matched length, identities and analogies 

35 compared with publicly known amino acid translation sequences. Moreover, the corresponding positions were con- 
firmed via the alignment of the nucleotide sequence of an arbitrary ORF with the nucleotide sequence of SEQ ID NO: 
1 . Also, the positions of nucleotide sequences other than the ORFs (for example, ribosomal RNA genes, transfer RNA 
genes, IS sequences, and the like) on the genome were determined. 

[0373] Fig. 1 shows the positions of typical genes of the Corynebacterium giutamicum ATCC 1 3032 on the genome. 
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Function 


hypothetical membrane protein 


2,5-diketo-D-gluconic acid reductase 


5 -nucleotidase precursor 


5'-nudeotidase family protein 


transposase 


organic hydroperoxide detoxication 
enzyme 


ATP-dependent DNA helicase 
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Table 1 (continued) 


Homologous gene 
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Corynebacterium sp. ATCC 
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Vibrio parahaemolyticus nutA 


Deinococcus radiodurans 
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ferric enterobactin transport syscem 
permease protein 
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hypothetical membrane protein 


serineAhreonine protein kinase 
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probable electron transfer protein 
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DNA polymerase III holoenzyme tau 
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hypothetical protein 


thioredoxin reductase 


PrpD protein for propionate 
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carboxy phosphoenolpyruvate 
mutase 
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regulatory protein 


hypothetical protein 


hypothetical protein 
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hypothetical protein 


hypothetical protein 






hypothetical protein 


regulatory protein 
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Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
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hypothetical protein 


phosphoserine transaminase 


acetyl-coenzyme A carboxylase 
carboxy transferase subunit beta 


hypothetical protein 


sodium/proline symporter 




hypothetical protein 
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fatty-acid synthase 
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Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
H37Rv RvOB83c 


Bacillus circulans ATCC 21783 


Escherichia coli K12 accD 
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repressor of the high-affinity (methyl) 
ammonium uptake system 


hypothetical protein 
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Table 1 (continued) 


Homologous gene 
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Function 




imidazoleglycerol-phosphate 
dehydratase 


histidinol-phosphate 
aminotransferase 


histidinol dehydrogenase 


serine-rich secreted protein 






histidine secretory acid phosphatase 


tet repressor protein 


glycogen debranching enzyme 


hypothetical protein 


oxidoreductase 


myo-inositol 2-dehydrogenase 


galactltol utilization operon repressor 


ferrichrome transport ATP-binding 
protein or ferrichrome ABC 
transporter 


hemin permease 


iron-binding protein 


iron-binding protein 


hypothetical protein 
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ONA polymerase III epsilon chain 




maltooligosyi trehalose synthase 


hypothetical protein 










alkanal monooxygenase alpha chain 


hypothetical protein 




maltooligosyitrehalose 
trehalohydrolase 


hypothetical protein 


threonine dehydratase 
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Table 1 (continued) 


Homologous gene 
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Streptomyces coelicolor A3(2) 
SC7H2.05 




Arthrobacter sp. Q36 treZ 


Bacillus subtitis 168 


Corynebacterium glutamicum 
ATCC 13032 ilvA 






Catharanthus roseus metE 


Streptomyces coelicolor A3(2) 
dnaE 


Escherichia coli K12 rarO 


Campylobacter Jejuni DZ72 his 


Archaeoglobus fulgidus AF238 


40 


db Match 


gp:SCI8_12 




pir:S65769 


j 

o 
o 

3 

o 

UJ 

< 

CL 
Ol 










_l 
O 
X 
O. 

1 

CL 
*A 


CN 

X 
r*. 
O 
CO 
a. 
cn 




o . 
r*- 
r- 
m 

CD 
CO 

l: 
"ex 


sp:YVYE_BACSU 


sp:TH01_CORGL 






pir:S57636 


< 

co 

CO 

o 
in 

CN 

t 
Cl 


—i 

o 
o 

LU 

o 1 
or 

CL 
tfl 


LU 
- > 

< 

I' 

X 

cL 

in 


CO 

in 
cn 

8 




u 


1143 


to 
o 

CO 


2433' 


1023 


cn 
cn 

CO 


CO 

cn 


cn 

CO 


1056 


1044 


CO 

CO 


CO 
CN 


1785 


in 
CO 


1308 


O 

in 


to 
in 


12031 


3582 


o 

s 


CO 

to 
•<r 


CO 

o> 


45 


Terminal 
(nt) 


2234070 


2234763 


2237284 


2238353 


2236694 


2239845 


2240058 


2239508 


2241724 


2241738 
j 2242129~ 


2244819 


2242393 


2244864 


2246692 


2246295 


2247006 


co 
in 

CO 
CO 
TT 
CN 
CN 


2252856 


cn 
m 
to 

CO 

m 

CN 
CN 


2254642 


50 


Initial 
(nt) 


2232928 


2234158 


2234852 


2237331 


2239092 


2240042 


1 2240246 


1 2240563 


2240681 


2242115 


2242359 


2243035 


2243043 


2246171 


2246386 


2246450 


2248208 


2251939 


2252017 


2253192 


2253725 | 




SEQ 
NO. 
(a a.) 


•5815 


5816 


5817 


5818 


5819 


5820 


5821 




5823 


5824 




5826 


5827 


5828 


5829 


5830 


5831 


5832 


5833 


*T 
CO 
CO 

in 


5835 [ 


55 


UJ Q z 


2315 


2316 


2317 


2318 


cn 

T— 

CO 
CM 


J 2320 | 


I2321J 


[2322 


2323 


2324 


2325 


2326 


2327 


2328 


2329 


j 2330 


I 2331 


2332 


2333 


2334 


m 

CO 

to 1 

cn ; 



157 



EP 1 108 790 A2 



5 
10 




Function 


short chain dehydrogenase or 
general stress protein 


diaminopimelate (DAP) 
decarboxylase 


cysteine synthase 




ribosomal large subunit 
pseudouridine synthase D 


lipoprotein signal peptidase 




oleandomycin resistance protein 




hypothetical protein 
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Table 1 (continued) 


Homologous gene 


Bacillus subtilis 168 ydaD 


Pseudomonas aeruginosa lysA 


X 

o 

«n 

Z3 
XZ 
CL 

o 

*3 

OJ 

8 

C 
OJ 

cn 

is 5 

</> 

< o 




Escherichia coli K12rluD 


Pseudomonas (luorescens NCIB 
10586 IspA 




Streptomyces antibioticus oleB 




Rhodococcus erythropolis orfl7 


Bacillus licheniformis 


Escherichia coli K12dinP 


Escherichia coli K12 ybiF 


Streptomyces coelicolor A3(2) 
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Function 


cytochrome c oxidase subunit III 




hypothetical membrane protein 


cytochrome c oxidase subunit II 


gluta mine-dependent 
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protein) 


hypothetical protein 


hypothetical membrane protein 
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Function 


xanthine permease 


2,5-diketoD-gluconic acid reductase 






50S ribosomal protein L27 


50S ribosomal protein L21 


ribonuclease E 








hypothetical protetn 


transposase (insertion sequence 
IS31831) 


hypothetical protein 


hypothetical protein 


nucleoside diphosphate kinase 




hypothetical protein 


hypothetical protein 


hypothetical protein 




Matched 
length 
(aa) 


CM 
CM 

"V 


CO 

r- 

CM 






CO 


*- 
o 


CO 
CO 
CO 








m 






CO 






CN 
CO 


OJ 


CO 
V" 


Similarity 
(%) 


77.3 


81.9 






92.6 


82.2 


56.6 








82.6 


100.0 


76.9 


67.8 


89.6 




<«; 
CO 


64.3 


CO 
co 
CO 


Identity 
(%) 


39.1 | 


61.2 






80.3 


56.4 


30.1 








61.0 


99.1 


51.3 


37.8 


70.9 




34.8 


36.6 


33.9 


Homologous gene 


Bacillus subtitls 166 pbuX 


Corynebacterium sp. ATCC 
31090 






Streptomyces griseus IF013189 
rpmA 


Streptomyces griseus IF013189 
obg 


Escherichia coli K12rne 








Streptomyces coelicolor A3(2) 
SCF76.08C 


Corynebacterium glutamicum 
ATCC 31831 


Streptomyces coelicolor A3(2) 
SCF76.08C 


Streptomyces coelicolor A3(2) 
SCF76.09 


Mycobacterium smegmatis ndk 




Deinococcus radiodurans R1 
DR1844 


Mycobacterium tuberculosis 
H37Rv Rv1B83c 


Mycobacterium tuberculosis 
H37Rv Rv2446c 


db Match 
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CO 
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co 
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or 
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<n 
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3 
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o 

CO 

CL 

cn 
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CO 


m 
cn 

CO 

o 

U. 

< 

Ck. 
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CO 
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o 
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UJ 
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ii 
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CO 
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o 
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CO 
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CO 


m 
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CO 
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Terminal 
(nt) 


2501669 


2501735 


2503355 


2504265 


2503984 


2504300 


2504831 


2507663 


2507710 


i 2508640 


2509530 | 


2509523 


2511423 


2511876 


2511949 


2512409 


2513144 


2513154 


2513692 




Initial 
(nt) 


2499783 


2502577 


2502735 


2503870 


2504247 


2504602 


2507098 


2507115 


2507138 


2508094 


2508922 


2510830 


2511046 


2511427 


i 

2512356 


2512768 


2512803 


2513618 


2514114 
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cc 
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a 

(C 
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6089 
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6091 


CN 
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o 

CO 


6093 
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! 


Sol 

j w z c 
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Function 


folyl-polygiutamate synthetase 






valyl-tRNA synthetase 


oligopeptide ABC transport system 
substrate-binding protein 


heat shock protein dnaK 


lysine decarboxylase 


malate dehydrogenase 


transcriptional regulator 


hypothetical protein 


vanillate demethylase (oxygenase) 


pentachlorophenol 4- 
monooxygenase reductase 


transport protein 


malonate transporter 


class-Ill heat-shock protein or ATP- 
dependent protease 


hypothetical protein 


succinyl CoA'3-oxoadipate CoA 
transferase beta subunit 


succinyl CoA'3-oxoadipate CoA 
transferase alpha subunit 


15 




Matched 
length 
(a-a) 


LO 

*r 






m 
o 


CM 

m 


CO 

0 

tn 


0 

r- 
*— 


cn 

CO 


0 

CM 


CO 

8 


tn 

CO 


CO 

CO 
CO 




CO 
00 
CM 


0 

CO 


CO 
CO 
CO 

I 


0 

CM 


LO 
CN 


20 




Similarity 
(%) 


79.6 






72.1 


58.5 


54.9 


71.2 


76.5 


56.5 


vn 


68.6 


59.2 


76.8 


58.4 


85.8 


73.0 


85.7 


84:5 






Identity 
(%) 


55.4 






to 

to* 


CM 

xr 

CM 


26.2 


42.9 


to 
tn 


24.6 


0 

to 

CM 


39.5 


32.8 


CO 

0 


28.0 


59.8 


45.6 


63.3 


60.2 


25 
30 


Table 1 (continued) 


Homologous gene 


Streptomyces coelicolor A3(2) 
folC 






Bacillus subtilis 168 balS 


Bacillus subtilis 168 oppA 


Bacillus subtilis 168 dnaK 


Eikenella corrodens ATCC 
23824 


Thermus aquaticus ATCC 33923 
mdh 


Streptomyces coelicolor A3(2) 
SC4A10.33 


Vibrio cholerae aphA 


Acinetobacter sp. vanA 


Sphlngomonas flava ATCC 
39723 pcpD 


Acinetobacter sp. vanK 


Klebsiella pneumoniae mdcF 


Bacillus subtilis dpX 


Streptomyces coelicolor A3(2) 
SCF55.28C 


Streptomyces sp. 2065 pcaJ 


Streptomyces sp. 2065 peal 
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40 
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2514114 


2516273 
2516956 


2517751 


2515637 


2518398 


2521660 


2521667 
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2524337 


2524340 


2526226 


2527207 


2528559 


| 2528551 


2529484 


2531976 


2531969 


2532604 


50 
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2515487 


cm co 
co 

to CN 
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to i m 

CM J CN 


2517089 


2518336 


CM 

cn 
cn 

tO 

CM 


2520209 


2522251 


2523248 


2523561 


2524915 


2525099 


2526233 


2527135 


2529480 


2530761 


2530891 


2532601 


2533353 
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Function 


protocatechuate catabolic protein 


beta-ketothioiase 




3-oxoadipate enoMactone hydrolase 
and 4-carboxymuconolactone 
decarboxylase 


transcriptional regulator 


3-oxoadipate enol-lactone hydrolase 
and 4-carboxymuconolactone 
decarboxylase 




CO 
«3 
C 

o 

u 

D 

E 

in 

* {S 

?i 

CO o 


protocatechuate dioxygenase alpha 
subunit 


protocatechuate dioxygenase beta 
subunit 


hypothetical protein 


muconotactone isomerase 




muconate cycloisomerase 




catechol 1,2-dioxygenase 


i 
i 

j 


toluate 1,2 dioxygenase subunit 


Matched 
length 
(aa) 


m 

CN 
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CO 
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CO 
CN 




CO 


Similarity 
(%) 


82.5 


71.9 




76.6 


43.0 


89.6 




63.4 


to 

s 


91.2 


48.7 


81.5 




84.7 




68.4 




65.6 


Identity 


58.2 


CO 




50.8 


23.6 


78.3 




39.8 


49.5 


74.7 


26.4 


54.4 




60.8 




72.3 




02.2 


Homologous gene 


Rhodococcus opacus 1CP pcaR 


Ralstonia eutropha bktB 




Rhodococcus opacus pcaL 


Streptomyces coelicolor A3(2) 
SCM1.10 


Rhodococcus opacus pcaL 




Rhodococcus opacus pcaB 


Rhodococcus opacus pcaG 


Rhodococcus opacus pcaH 


Mycobacterium tuberculosis 
H37Rv Rv0336 


Mycobacterium tuberculosis 
catC 




Rhodococcus opacus 1CP catB 




Rhodococcus rhodochrous catA 




Pseudomonas putida plasmid 
pDK1 xylX 
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2534182 


2535424 
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2538256 


2538248 
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2538616 


2539709 


2540335 


2541187 


2542512 


2543813 


2542818 


2544867 


2544022 


2544928 


2546784 


Initial 
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2533391 


2534201 i 


2535168 
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6129 
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CO 
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to 
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Function 


toluate 1,2 dioxygenase subunit 


toluate 1,2 dioxygenase subunit 


1 ,2-dihydroxycyclonexa- J.a-aiene 
carboxylate dehydrogenase 


t 

L 

I 

c 
a 

>» 

i 

s 

o * 

I» 
*S c 

cn c 

2 3 


transmembrane tran6pon proiein or 
4-hydroxybenzoate transporter 


benzoate membrane transport 
protein 


ATP-dependent Clp protease 
proteolytic subunit 2 


ATP-dependent Clp protease 
proteolytic subunit 1 


hypothetical protein 


trigger factor (prolyl isomerase) 
(chaperone protein) 


hypothetical protein 


penicillin-binding protein 


hypothetical protein ! 




transposase 




hypothetical protein 


transposase J 


Matched 
length 
(aa) 
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CO 
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tn 

CO 
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Similarity 
(%) 


83.2 


81.0 


61.4 


46.6 


64.4 


66.2 


88.3 


85.9 


71.4 


66.4 


63.1 


609 


S 58.3 




73.2 




62.9 


78.7 


Identity 
(%) 


60.3 


51.5 


30.7 


23.3 


31.3 


29.9 


69.5 


62.1 


42.9 


32.1 


32.5 


25.3 


27.8 




54.2 




57.1 


50.7 


Homologous gene 


Pseudomonas putida plasmid 
pDK1 xylY 


Pseudomonas putida plasmid 
pDK1 xylZ 


Pseudomonas putida plasmid 
pDK1 xylL 


Rhodococcus erythropolis thcG 


Acinetobacter calcoaceticus 
pcaK 


Acinetobacter calcoaceticus 
benE 


m 

S 

w 

o 
a 

JU 

o 
a 

(A 
CJ 
U 
>s 

E 
o 

OJ Ou 
«£; Cv 

to 75 


Streptomyces coelicolor M145 
clpP1 


Sulfolobus isiandicus ORF154 


Bacillus subtilis 168 tig 


Streptomyces coelicolor A3(2) 
SCD25.17 


Nocardia lactamdurans LC411 
pbp 


Mus muscuius Moa1 




Corynebacterium striatum ORF1 




Corynebacterium striatum ORF1 


Corynebacterium striatum ORF1 
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8 
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Ql 
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CO 
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1536 
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CO 
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*r 
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to 
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to 


o 
vn 


CO 
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m 

CO 


!co 
: m 
: *r 


oo 

CM 
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to 

CM 


to 

CN 
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CO 

to 

CO 

co 

5 

CN 


2549695 


2552455 
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2555317 


2555978 


2556748 
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2561363 


2561483 
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58 

CN 
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2562078 


j 
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2546827 


2547333 


2548868 


2549771 
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2554026 
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2556580 


2556599 


2558106 


2558609 


2559157 


2560131 


2561115 


2561920 


2562093 


2562115 


2562341 


.gg: 
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T— 

to 
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R149 


6150 


1 
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c 














rotein 
















I 


p 




















Function 


acetylornilhine aminotransferase 


hypothetical protein 


hypothetical membrane protein 


acetoacetyl CoA reductase 
transcriptional regulator, TetR U 


polypeptides predicted to be us< 
antigens for vaccines and 
diagnostics 


ABC transporter ATP-binding pr 


globin 


chromate transport protein 


hypothetical protein 


hypothetical protein 




hypothetical protein 


ABC transporter ATP-binding pi 


hypothetical protein 


hypothetical membrane protein 


alkaline phosphatase 


Matched 
length 
(aa) 




CN 
CO 


CO 
CN 


LO C 

co 

CN CN 


s 


CO 
CO 
CN 




CO 


CD 
Ol 


CN 




in 
in 


CO 
CD 

m 


Ol 


o 

O 

r*. 


to 

CO 
LO 


milarity 
(%) 
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2703194' 


2704314 


m 

CO 
CO 
T 
O 
r«- 
. CM 


2709878 


2710637 






SEQ 
NO 
(aa.) 


6290 


6291 


6292I 


6293 


6294 


6295 


;8 
iS 


6297| 


6296 


6299 


6300 


6301 


6302 


6303 


6304 


6305 


6306 


6307 


6308 


55 




S §I 

CO 2 Q 


2790 


2791 


I2792 


• CO 
CO 

: r- 
jCM 


cn 

CN 


2795 


2796 


2797 


2798 


2799 


o 
o 

CO 
CN 


2801 


2802 


2803 


2804 


2805 


2806 


2807 


2806 
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Function 


methyltransferase 
hypothetical protein 


hypothetical protein 


i 

<D 
C 

c « 

£ w 

(0 CO 

s & 
Is 

O) JO 
CL -O 


hypothetical protein 


transcriptional regulator 


cvstetne svnthase 


O-acetytserine synthase 


hypothetical protein 


succinyl-CoA synthetase aipna 
chain 


hypothetical protein , 


succinyl-CoA synthetase beta chain 




frenolicin gene E product 


I 


succinyl-CoA coenzyme A 
transferase 


transcriptional regulator 


15 


Matched 
length 
(aa) 




CM 




o 

cn 


00 

CM 


in 
c 
r 


CM 
I s - 


cn 

00 


cn 

CM 


m 
r^. 


o 

o 




cn 

CM 




o : 
! 


8 


20 


Similarity 
(%) 


51.2 


75.0 


75.3 


CM 
CO 


69.0 


tc 

at 


79.7 


65.1 


79.4 


43.0 


73.0 




71.8 




77.8 


68.5 




Identity 


25.9 


71.0 


eo 


66.3 


45.9 


r* 
u 


61.1 ' 


36.1 


cn 

CM 

m 


42.0 


CO 

cn 
cn 




38.5 




47.9 


38.6 


6 Si 8 SI 
Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
H37Rv Rv0089 


cn 

OI 

» ^ 
1 E 
5 2 

Z CO 

= 1 

10 CO 

E 

co ca o 
ez j= O 


Acinetobacter calcoaceticus 
NCIB 8250 murA 


Mycobacterium tuberculosis 
H37RvRv1314c 


Streptomyces coelicolor A3{2) 
SC2G5.15c 


I 

" 
c 

CJ 

! 
( 


CM 
LU 

- tA 

£ >» 
n cj 
— 
^ -6 

0 c 

3 CO 
- 0J 
*» .E 
= > 

1 * 

to u 
tn JO 

i € 

o o 

S 3 


Deinococcus radiodurans R1 
DR1844 


Coxiella burnetii Nine Mile Ph I 
sucD 


Aeropyrum pernix K1 APE1069 


Bacillus subtilis 168 sucC 




Streptomyces roseofulvus frnE 




Clostridium kluyveri call cati 


Azospirillum brasilense ATCC 
29145 ntrC 
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3 
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tA 
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cn 

O CL 1 
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5 

CL 

(A 


3 

5 
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CL 
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S 
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O 

o 
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O) 


z> 
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cn 

CO 
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o 

CL 
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-j 
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°. 
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!■ 
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m 

CM 

LO 
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o 
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cn 

tT 
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CO 

o 
-or 
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cm 
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CO 

CM 
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m 
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CM 


1194 


o 


in 
cn 
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cn 

CO 


1539 
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45 


Terminal 
(nt) 


2712374 


2713453 
2713842 


2717993 
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2720385 


2721295 


2722857 
2723609 
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2724478 


2725843 


2725384 


2726786 


cn 
cn 
cn 
r*- 

CM 
« f*- 
. CM 


2728207 


2729378 


2732518 


50 


Initial 
(nt) 


i — — 
o 

IT) 
CO 

CM 


2713181 
2713702 


2718187 
2719689 


2719750 


2721227 


2721702 


2721934 
5723064 


2724057 


2725359 


cn 

to 
m 

CN 

f- 

CN 


2726577 


in 

CN 

r- 


272B133 


2729025 


2730916 


2731376 




2§; 


6309 


6310 
6311 


6312 
6313 


6314 
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6316 
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6319 
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T- 

o 
<r 

C£ 
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m 

CN 
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to 


6326 
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8 

30 



35 



40 



45 



50 



55 



Fundion 




phosphate transport system 
regulatory protein 


phosphate-specific transport 
component 


phosphate ABC transport system 
permease protein 


phosphate ABC transport system 
permease protein 


phosphate-binding protein S-3 
precursor 


acetyttransferase 




hypothetical protein 


< hypothetical protein 


branched-chain amino acid 
aminotransferase 


hypothetical protein 


hypothetical protein 


S-phosphoribosyl-S-aminoimidazole 
synthetase 


amidophosphoribosyl transferase 


Matched 
length 
(aa) 




CO 
CN 


UO 

to 

CN 


CN 
O) 
CN 


vo 

CN 

CO 


CO 
CO 
CO 


to 

CO 




CO 


to 

CN 
CN 


CO 

to 

CN 


CM 

to 

CO 


co 

to 


CO 


CM 
CO 


Similarity 
(%) 


































81.7 


62.8 


82.2 


78.5 


56.0 


60.0 




CN 

to 
to 


74.2 


56.0 


79.0 


81.0 


94.2 


89.0 


Identity 
(%) 




46.5 


58.8 


51.4 


50.2 


40.0 


34.3 




I 24.7 


44.9 


28.6 


■ 

58.5 


CO 

CO 

LO 


81.0 


70.3 


Homologous gene 




Mycobacterium tuberculosis 
H37RvRv0821cphoY-2 


Pseudomonas aeruginosa pstB 


Mycobacterium tuberculosis 
H37Rv Rv0830 pstA1 j 


Mycobacterium tuberculosis 
H37Rv Rv0829 pstC2 


Mycobacterium tuberculosis 
H37Rv phoS2 


Streptomyces coelicolor A3(2) 
SCD84.18C 




Bacillus subtilis 168 bmrU 


Mycobacterium tuberculosis 
H37RvRv0813c 


Solan urn tuberosum BCAT2 


Corynebacterium 
ammoniagenes ATCC 6872 
ORF4 


Mycobacterium tuberculosis 
H37Rv Rv0810c 


Corynebacterium 
ammoniagenes ATCC 6872 
purM 


Corynebacterium 
ammoniagenes ATCC 6872 
purF 
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UJ 
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to 
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o 
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cn 
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to 
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CL 

cn 


to 

CO 
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CL 

cn 
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CO 
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f- 


cn 

CO 
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cn 
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fS 

CO 


CO 
CO 

r- 
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00 

CO 


CM 
CO 
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CO 
CM 


1074 
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(nt) 


2731424 


2733367 


2733455 


2734264 


2735202 


2736414 


2737836 


2739553 


2739556 


2741356 


2741636 


2743785 


2744222 


2744881 


2746083 
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(nt) 


2732230 


2732636 


1 

2734351 


2735184 


2736215 


2737538 


2738711 


2738771 


2740650 
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2742577 
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2744010 


2745954 


2747564 
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6332 
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Function 


S'-phosphoribosyM-N- 
succinocarboxamide-S-amino 
imidazole synthetase 


adenylosuccino lyase 


aspartate aminotransferase 


5-phosphoribosylglycinamide 
synthetase 

histidine triad (HIT) family protein 


hypothetical protein 
dU/tripeptide transpoter 


adenosylmethionme-B-amimw- 
oxononanoate aminotransferase or 
7,8-diaminopelargonic acid 
aminotransferase 


dethiobiotin synthetase 


two-component system sensor 
histidine kinase 


two-component system regulatory 
protein 


| transcriptional activator 


metal-adivated pyridoxal enzyme or 
low specificity D-Thr aldolase 


15 


Matched 
length 
(aa) 


3 

CM 


r- 
r- 


m 

CD 
CO 


tn co 

CM CO 
tt r- 


CO o> 
•^r co 

CM <Cf 


ro 

CM 


3 


tn 

a 


CO 
CM 


s 


2 

CO 


20 


Similarity 
(%) 


89.1 


95.0 


62.3 


Tj- CN 

CO 0 
CO 00 


56.4 
67.6 


98.8 


99.6 


70.5 


72.7 


69.5 


o> 

CO 

tn 




identity 
(%) 


70.1 


85.3 


28.1 


CO 

r- tn j 


26.8 

JU.l 


h» 

in 
o> 


98.7 


CO 
CO 


42.0 


37.4 


30.9 


Table 1 (continued) 


Homologous gene 


Corynebacterium 
ammoniagenes ATCC 6872 
purC 


Corynebacterium 
ammoniagenes ATCC 6872 
purB 


Sulfolobus soifataricus ATCC 
49255 


Corynebacterium 
ammoniagenes ATCC 6872 
purO 

Mycobacterium leprae u296a 


Methanosardna barken orf3 
Lactococcus lactis subsp. lactis 
dipT 


Corynebacterium glutamicum 
(Brevibaderium flavum) MJ233 
bioA 


Corynebacterium glutamicum 
(Brevibaderium flavum) MJ233 
bioD 


Lactococcus ladis M71plasmid 
PND306 


Thermotoga maritima drrA 


Streptomyces IMdans tipA 


Arthrobacter sp. DK-38 


.» 

40 
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1356 
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CN 
CO 
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tn 
0 
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CO 

m 
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45 


Terminal 
(nt) 


2761829 
2761785 


2763504 


2764978 


2766158 
2767993 


2767703 
2768343 

2769156 


2771982 


2772660 


2772644 


2774110 


2774937 


2775740 
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Initial 
(nt) 


2762452 
2762675 


n 
o> 
^* 

CD 

r«- 

CM 


2766135 


2767420 
2767580 


2768137 
2769095 

2770511 


2770714 


2771989 


2774098 


2774814 


2775689 


2776879 
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6358 
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,6363 


6364 
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6367 
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6369 
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6371 
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NO. 

(DNA) 
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2861 
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2865 
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i 

1 r- 
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1 CN 


2868 


2869 
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CO 

,CN 
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Function 


pyruvate oxidase 


multidrug efflux protein 


transcriptional regulator _! 


hypothetical membrane protein | 




3-ketosteroid dehydrogenase 


transcriptional regulator, LysR family 


hypothetical protein 


hypothetical protein { 




hypothetical protein 


hypothetical membrane protein 


transcription initiation factor sigma 


trehalose-6-phosphate synthase 




trehalose-phosphatase 


glucose-resistance amylase 
regulator 


e 

V 

To 

>s 

tfl 

0> 
CO 
Q. 
3 
U 

c 

N 

>» 
C 

Is 

£ Cl 
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length 
(aa) 
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Similarity 
(%) 


75.8 


68.9 


68.5 


CO 
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62.1 


69.0 


52.9 


55.6 




50.7 


64.0 


50.3 


66 7 




57.6 


60.2 


467 






Identity 
(%) 


45.3 


33.3 


30.4 


CO 
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34.3 


37.1 


28.4 


26.7 




28.6 


36.0 


32.3 | 


38.8 




27.4 


24.7 


22.4 


25 
30 

35 


Table 1 (continued) 


Homologous gene 


Escherichia coli K12 poxB 


Staphylococcus aureus plasmid 
pSK23 qacB 


Escherichia coli K12 ycdC 


Mycobacterium tuberculosis 
H37RvRv2508c 




Rhodococcus erythropolis SQ1 
kstDI 


Bacillus subtilis 168 alsR 


Mycobacterium tuberculosis 
H37Rv Rv3298clpqC 


Bacillus subtilis 168 ykrA 




Oryctolagus cunicutus kidney 
cortex rBAT 


Mycobacterium tuberculosis 
H37Rv Rv3737 


Streptomyces griseus hrdB 


Schizosaccharomyces pombe 
tpsi 




Escherichia coli K12 otsB 


Bacillus megaterium ccpA 


Haemophilus influenzae Rd 
HIQ119znuA 


40 
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2776768 
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2792448 


2792857 
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Initial 
(nt) 


2778504 


2778965 


2780439 


2780996 


2784481 


2785615 
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m 
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Function 


ABC transporter 


hypothetical membrane protein 
transposase (ISA0963-5) 


3-ketosteroid dehydrogenase 


lipopolysaccharide biosynthesis 
protein or oxidoreductase or 
dehydrogenase 


dehydrogenase or myo-mosttol 2- 
dehydrogenase 


shikimate transport protein 


shikimate transport protein 


transcriptional regulator 


ribosomal RNA ribose methylase or 
tRNA/rRNA methyltransferase 


cystelnyMRNA synthetase 


PTS system, en2yme II sucrose 
protein {sucrose-specific IIABC 
component) 


sucrose 6-phosphate hydrolase or 
sucrase 


glucosamine-6-phosphate 
isomerase 
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CM 
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CO 
CO 
CO 


CM 


Homologous gene 


Staphylococcus aureus 8325-4 
mreA 


Mycobacterium tuberculosis 
H37Rv Rv2060 
Archaeoglobus fulgidus 


Rhodococcus erythropolis SQ1 
kstDI 


Thermotoga maritima MSB8 
bplA 

i 


Bacillus subtilis 166 idh or iolG 


Escherichia coli K12shiA 


Escherichia coli Kl2shlA 


Streptomyces coelicolor A3(2) 
SC5A7.19C 


Saccharomyces cerevlsiae 
YOR201C PET56 


Escherichia co!IK12cysS 


Lactococcus ladis sacB 


Clostridium acetobutylicum 
ATCC 824 scrB 


Escherichia coli K12 nagB 


Vibrio furnissii SR1514 manD 
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(nl) 
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2801034 


2801313 
2801558 
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2804074 


2804676 


2805113 


2806016 


2806599 


2807426 


280B399 


2809824 


2811960 


2813279 


2814081 


Initial 
(nt) 


2797820 


2798837 


2801113 
2803246 


2803996 
2804691 


2805110 


2805967 


2806441 


2807252 


2808364 


2809778 


2811606 


2813258 


2814037 


2815232 
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CO Z < 


6391 
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6395 
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6397 
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cn 

CD 
f 
IC 


o 
o 

to 
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Function 


dihydrodipicolinate synthase 


glucokinase 


N-acetylmannosamine-6-phosphate 
epimerase 




sialidase precursor 


L-asparagine permease operon 
repressor 


dipeptide transporter protein or 
heme-binding protein 


dipeptide transport system 
permease protein 


oligopeptide transport ATP-binding 
protein 


oligopeptide transport ATP-binding 
protein 


homoserine/homoserin lactone 
efflux protein or lysE type 
transiocator 


leucine-responsive regulatory 
protein 




hypothetical protein 


hypothetical protein 


transcription factor 
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length 
(aa) 
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CO 
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CN 
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28.7 


36.4 




24.8 


26.6 


22.5 


31.9 


46.5 


CO 


28.5 


31.0 




55.9 




73.3 


Homologous gene 


Escherichia coli K12 dapA 


Streptomyces coelicolor A3{2) 
SC6E10.20c glk 


Clostridium perfringens NCTC 
8798 nanE 




Micromonospora vihdifaciens 
ATCC 31146 nadA ! 


Rhizobium etli ansR 


Bacillus firmus OF4 dppA 


Bacillus firmus OF4 dappB 


Bacillus subtilis 168 oppD 


Lactococcus ladis oppF 


Escherichia coli K12 rhtB 


Bradyrhizobium japonicum Irp 




Mycobacterium tuberculosis 
H37Rv Rv3581c 


Mycobacterium tuberculosis 
H37Rv Rv3582c 
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H37Rv Rv3583c 
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CO 
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2827458 


2827904 


2828379 
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2815458 
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2817363 


2816313 


2819564 


2820285 


2820584 


2822387 


2824274 
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2829146 


2829749 
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Function 


virulence factor 


CIpC adenosine triphosphatase / 
ATP-blnding proteinase 


inosine monophosphate 
dehydrogenase 


transcription factor 


phenol 2-monooxygenase 










lincomycin resistance protein 


hypothetical protein 


lysyMRNA synthetase 


CD 
CA 

ra 

CA 
CD 

c 

*c 

JO 
CO 

A 

J 

"55 
o 
c 

CD 
CL 


i 




hypothetical membrane protein 


eg 
c 

.5 

"H. 
CD S 

2 *6 c 

>»-C O 

^ "55 -S 
T E 8" 

o >> O 
c X .C 

fc ±r O 
CM £ Q. 


dihydroneopterin aldolase 


dihydropteroate synthase ] 


15 




Matched 
length 
(aa) j 


in 

in 


CM 
CO 
CO 


o> 

CO 


CO 
CO 


o 

CO 
CO 










CO 
T 


o 
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Similarity 
(%) 
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CM 

CD 
CO 


70.2 


r^- 

CN 
CO 
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o 

CO 










100.0 


55.8 


71.2 


52.6 






69.6 i 


69.0 


i 69.5 


75.0 






Identity 
(%) 


74.0 


58.5 


37 1 


CM 


33.5 










100.0 


26.7 




29.9 






29.0 


42.4 


38.1 


51.5 


25 
30 
35 


Table 1 (continued) 


Homologous gene 


Pseudomonas aeruginosa 
ORF25110 


Bacillus subtilis 168 mecB 


Bacillus cereus ts-4 impdh 


Rhodococcus rhodochrous nitR 


Trichosporon cutaneum ATCC 
46490 










Corynebacterium glutamicum 
ImrB 


Mycobacterium tuberculosis 
H37RvRv3517 


Bacillus stearothermophilus IysS 


Corynebacterium glutamicum 
ATCC 13032 panC 






Mycobacterium leprae 
MLCB2548.04C 


Methylobaderium extorquens 
AM1 folK 


Bacillus subtilis 168 folB 


Mycobacterium leprae folP 


40 
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2859205 
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2862132 
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2864384 
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! 2865346 


2865731 
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Function 


peptide synthase 




phenylacetaldehyde dehydrogenase 


hypothetical protein 


hypothetical protein 


hypothetical protein 


heat shock protein or chaperon or 
groEL protein 










i 




hypothetical protein 






peptidase 






Na«7H+ antiporter or multiple 
resistance and pH regulation related 
protein A or NADH dehydrogenase 


15 


Matched 
length 
(aa) 


1241 




CO 
CO 


CNI 


"I 
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CO 


CO 
T 
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Identity 
(%) 


28.4 




35.0 


57.3 


62.0 


74.0 


99.5 














21.7 






37.1 






35.6 


Homologous gene 


Streptomyces roseosporus cpsB 




Escherichia coli K12 padA 


Campylobacter jejuni Cj0604 


Mycobacterium tuberculosis 


Mycobacterium tuberculosis 


Brevibacterium flavum MJ-233 














Homo sapiens MUC5B 






Mycobacterium tuberculosis 
H37Rv Rv2522c 






Staphylococcus aureus mnhA 
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Function 




membrane transport protein or 
bicyclomycin resistance protein 


sodium dependent phosphate pump 


phenazine biosynthesis protein 




ABC transporter 


ABC transporter ATP-binding protein 


mutator mutT protein 


hypothetical membrane protein 


glutam ne-binding protein precursor 


serine/threonine kinase 




ferredoxin/ferredoxin-NADP 
reductase 


acetyltransferase (GNAT) family 


t 
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CO 
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cn </} 
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O CO 
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(A ^ 
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Matched 
length 
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Similarity 
(%) 
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68.9 


56.4 
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66.3 


68.5 


70.2 


64.8 


63.5 




CO 
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CO 
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82.6 






Identity 
(%) 




31.6 


28.5 


38.8 




24.3 

l . 


36.9 


47.6 


35.0 


31.5 


41.2 




37.2 


34.0 








59.1 




Table 1 (continued) 


Homologous gene 




Escherichia coli K12 bcr 


Vibrio cholerae JS1569 nptA 


Pseudomonas aureofaciens 30- 
84 phzC 




Streptomyces coelicolor A3(2) 
SCE8.16C 


Bacillus licheniformis ATCC 
9945A bcrA 


Mycobacterium tuberculosis 
H37Rv Rv0413 


Mycobacterium tuberculosis 
H37RvRv0412c 


Bacillus stearothermophilus 
NUB36 glnH 


Mycobacterium tuberculosis 
H37RvRv0410c pknG 




Bos taurus 


Escherichia coli K12 elaA 








Bacillus subtilis 168 purT 
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insertion element (IS3 related) 


insertion element (IS3 related) 


two-component system sensui 
histidine kinase 

transcriptional regulator I 


adenylosuccinate synthetase 


hypothetical protein 


hypothetical membrane protein 
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Table 1 (continued) 


Homologous gene 
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virulence factor 


virulence tactor 


virulence factor 


sodium/glutamate symport earner 
protein 


cadmium resistance protein 


cation efflux system protein 
(zinc/cadmium) 


monooxygenase or oxidoreductase 
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Table 1 (continued) 


Homologous gene 


Pseudomonas aeruginosa 
ORF24222 
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N-carbamoyl-D-amino acid 
amidohydrolase 


hypothetical protein 


novel two-componem leguicuuiy 
system 

aldehyde dehydrogenase 


heat shock transcription reguiaior 
heat shock protein dnaJ 


nucleotide exchange factor grpt | 
protein bound to the ATPase domain 
of the molecular chaperone DnaK 


heat shock protein dnaK 


hypothetical membrane protein 
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Table 1 (continued) 


Homologous gene 


Methanobactetium 
thermoautotrophicum Delta H 
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Function 










hypothetical membrane protein 


hypothetical protein 




sulfate adenylyltransferase, subunit 
1 


sulfate adenylyltransferase small 
chain 


phosphoadenosine phosphosulfate 
reductase 


ferredoxin-nitrate reductase 


ferredoxin/ferredoxin-NADP 
reductase 


huntingtin interador 
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Table 1 (continued) 


Homologous gene 










Bacillus subtilis ytnM 


Streptomyces coelicolor A3(2) 
SC7AB.10C 




Escherichia coli K12cysN 


Escherichia coli K12 cysD 


Bacillus subtilis cysH 
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hypothetical protein 
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ABC transporter 
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oxidoreductase 


transcription antiterminator or beta- 
glucoside positive regulatory protein 


CO 
tf> 
CO 

*o 

8 

3 

cn 

3 

V 

ja 
6 

JZ 

ex 

<A 
O 
JZ 
CL 

CO 


i 


o> 
tn 
to 

"O 

"35 
o 
u 
z> 

Ol 
X) 

6 

JC 

o. 
tfl 
o 
.c 

CL 
CO 


aspartate aminotransferase 




transposase (ISCg2) 


hypothetical membrane protein 




UDP-glucose dehydrogenase 


deoxycytidine triphosphate 
deaminase 




hypothetical protein 




beta-N-Acetylgtucosaminidase 


Matched 
length 
(a.a) 


o 

CN 


CM 

cn 


r- 
co 




CO 
CD 


s 




o 


o> 
cn 

CO 




CN 

*r 


CO 
CO 

T— 




cn 

CM 
CN 




o 

r- 


milarity 
(%) 


63.8 


69.3 


59.9 




78.8 


80.9 




100.0 


70.2 




72.2 


72.3 




59.4 




58.1 


CO 


































Identity 
(%) 


34.8 


28.1 


437 




43.9 


53.7 


100.0 


33.6 




m 
d 


43.6 




30.6 




28.5 


Homologous gene 


Streptomyces coelicolor A3(2) 
mmyQ 


Escherichia coli K12bglC 


Clostridium longisporum B6405 
abgA 




Clostridium longisporum B6405 
abgA 


Methylobaciilus flagellars aat 




Corynebacterium glutamicum 
ATCC 13032 tnp 


Streptomyces coelicolor A3(2) 
SCQ11.10C 




Sinorhizobium melilott rkpK 


Escherichia coli K12dcd 




Streptomyces coelicolor A3(2) 
SCC75A.16C 




Streptomyces thermoviolaceus 
nagA 


db Match 


gp:SC0276673J8 
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co 
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CO 
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O 
CM 
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o 
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CO 


1203 


1257 


co 

CO 


1317 
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CO 

m 


CO 
CM 




1689 


in 

CO 


Terminal 


3028163 
3028891 


3029033 
3028884 


3029782 
3029702 


3030535 


3030101 


3031979 


3032348 


3033863 


3035437 


3034105 


o 

^ 

m 

CO 

o 

CO 


| 3036845 


3037911 


3038942 


3038993 


3040748 


Initial 
(nt) 


3027561 
3028268 


302B878 
3029474 


3029504 
3030061 


3030155 


3030340 


3030723 
3032647 


— 1 

3032661 \ 


3034181 


3034287 


CO 

in 
p- 
co 

CO 

o 

CO 


3037411 


m 
r- 
CC 
r- 

c 

r- 


3038172 


1 3040681 


3041932 
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NO. 
(a.a.) 


6626 
6627 


6628 
6629 


6630 

DO J I 
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n 
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CC 


6633 


r 
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cc 
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> co 
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) cc 


6636 


6637 
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CO 
CO 
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cc 
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T3 

CC 
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CO 

1 
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NO. 
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3128 
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o 
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1 CO 
» [ CO 


r 
c 


' LT 
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1 j CO 
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CO 
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XT 

» CO 
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i 
i 














lase 












5 




Function 






hypothetical protein 






hypothetical membrane protein 


acyltransferase or macroiide 3-0 
acyltransferase 




hypothetical membrane protein 




hexosyltransferase 


methyl transferase 


phosphoenolpyruvate carboxykir 
(GTP) 


C4-dicarboxytate transporter 


hypothetical protein 


hypothetical protein 


mebrane transport protein 




15 




Matched 
length 
(aa) | 






1416 i 






CO 
CO 
CO 


CO 

o 

TT 




CD 
CM 

m 




CD 
CO 
CO 


m 

CM 


§ 


CM 
CO 
CO 


s 


o 

CM 


CO 

CO 

r- 








Similarity 
(%) 






































20 








TT 

CD 

TT 






47.1 


51.0 




54.8 




79.1 


73.3 


78.5 


52.7 


67.2 


85.0 


72.3 








Identity 
(%) 






29.6 






CO 

TT 

CM 


27.7 




31.2 




53.4 


58.6 


54.7 


24.4 


35.7 


69.1 


42.3 




25 
30 
35 


Table 1 (continued) 


Homologous gene 






Mycobacterium leprae 
Ml.CB1883.13c 






Mycobacterium leprae 
MLCB1 883.05c 


Streptomyces sp. acyA 




Mycobacterium leprae 
MLCB 1883.04.: 




Mycobacterium tuberculosis 
H37RV Rv0225 


Mycobacterium tuberculosis 
H37Rv Rv0224c 


Neocaliimastix frontalis pepck 


Pyrococcus abyssi Orsay 
PAB2393 


Escherichia coli K12 yggH 


Mycobacterium tuberculosis 
H37Rv Rv0207c 


Mycobacterium tuberculosis 
H37Rv Rv0206c mmpL3 




40 
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«' 

CO 

CO 

5 

o 

—J 

Ui 
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to 
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TT 
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CM 
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CO 
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. r- 
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CM 
TT 
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CO 
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r- 
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to 
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r- 
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1422 
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Terminal 
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3042437 


3042703 


3045788 
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3048048 


3046122 


j 3047197 


3049479 


3051190 


3049456 


j 3051964 


3052062 


3055769 


3056631 


3057317 


3059643 


3056096 
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Initial 
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Oi 
Oi 
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O 
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3042503 
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3043642 


3045796 


3047146 


3047189 


3047904 j 
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3050592 
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3053891 
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Function 


hypothetical membrane protein 


hypothetical membrane protein 


propionyl-CoA carboxylase complex 
B subunit 


polyketide synthase J 


acyl-CoA synthase 


hypothetical protein 




major secreted protein PS1 protein 
precursor 






antigen 8S-C 


hypothetical membrane protein 


nodulation protein 
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67.4 




99.5 






62.5 
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Identity 
(%) 


29.1 
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33.5 
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27.1 


51.2 
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CO 
CM 


Homologous gene 


Mycobacterium tuberculosis 
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Mycobacterium tuberculosis 
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pccB 
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Mycobacterium tuberculosis 
H37Rv Rv3802c 




Corynebacterium glutamicum 
(Brevibacterium flavum) ATCC 
17965 copi 
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family 






Function 


transcriptional regulator 


hypothetical protein 


glucan 1,4-alpha-glucosidase 


glycerophosphoryl diester 
phosphodiesterase 

gluconate permease 


pyruvate kinase 


L-lactate dehydrogenase 


hypothetical protein 


hydrolase or haloacid 
dehalogenase-like hydrolase 


efflux protein 


transcription activator or 
transcriptional regulator GntR 


phosphoesterasa 


|shikimate transport protein 


Matched 
length 
(aa) 


§ 

CO 
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CN 
CO 
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cn 


OJ 


CO 
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CN 

m 


CN 
fN 
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CN 


in 
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CN 
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milarity 
(%) 


57.1 


81.3 


55.3 


54.1 
71.9 


47.7 


99.7 


64.8 


58.5 


67.6 


57.0 


68.6 
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GO 
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29.0 
37.3 




99.7 


33.5 


32.1 


39.9 


27.6 


47.8 
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Homologous gene 


Streptomyces coelicolor A3(2) 
SC6G4.33 


Streptomyces lavendulae 
ORF372 | 


Saccharomyces cerevisiae 
S288C YIR019C stal 


Bacillus subtilis glpQ 
Bacillus subtilis gntP 


Corynebacterium glutamicum 
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Mycobacterium tuberculosis 
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cytchrome b6-F complex iron-sulfur 
subunit (Rieske iron-sutfur protein) 


NADH oxidase or NADH-dependent 
flavin oxidoreductase 


hypothetical membrane protein 
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elongation factor Tu 


preprotein translocase secY 


isocitrate dehydrogenase 
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acyl-CoA carboxylase or biol 
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5 
10 




Function 
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Table 1 (continued) 
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Function 
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Table 1 (continued) 


Homologous gene 
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Function 
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ammonium transporter 
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Example 2 

Determination of effective mutation site 

5 (1 ) Identification of mutation site based on the comparison of the gene nucleotide sequence of lysine-producing B-6 
strain with that of wild type strain ATCC 13032 

[0374] Corynebacterium glutamicum B-6, which is resistant to S-(2-aminoethyl)cysteine (AEC), rifampicin, strepto- 
mycin and 6-azauracil, is a lysine-producing mutant having been mutated and bred by subjecting the wild type ATCC 

10 1 3032 strain to multiple rounds of random mutagenesis with a mutagen, N-methyl-N* -nitro-N-nitrosoguanidine (NTG) 
and screening (Appl. Microbiol. Biotechnol., 32: 269-273 (1989)). First, the nucleotide sequences of genes derived 
from the B-6 strain and considered to relate to the lysine production were determined by a method similar to the above. 
The genes relating to the lysine production include lysE and lysG which are lysine-excreting genes; ddh, dapA, horn 
and lysC (encoding diaminopimelate dehydrogenase, dihydropicolinate synthase, homoserine dehydrogenase and 

is aspartokinase, respectively) which are lysine-biosynthetic genes; and pyc and zwf (encoding pyruvate carboxylase 
and glucose-6-phosphate dehydrogenase, respectively) which are glucose-metabolizing genes. The nucleotide se- 
quences of the genes derived from the production strain were compared with the corresponding nucleotide sequences 
of the ATCC 13032 strain genome represented by SEQ ID NOS:1 to 3501 and analyzed. As a result, mutation points 
were observed in many genes. For example, no mutation site was observed in lysE, lysG, ddh, dapA, and the like, 

20 whereas amino acid replacement mutations were found in horn, lysC, pyc, zwf, and the like. Among these mutation 
points, those which are considered to contribute to the production were extracted on the basis of known biochemical 
or genetic information. Among the mutation points thus extracted, a mutation, Val59Ala, in horn and a mutation, 
Pro458Ser, in pyc were evaluated whether or not the mutations were effective according to the following method. 

25 (2) Evaluation of mutation, Val59Ala, in horn and mutation, Pro458Ser, in pyc 

[0375] It is known that a mutation in horn inducing requirement or partial requirement for homoserine imparts lysine 
productivity to a wild type strain (Amino Acid Fermentation, ed. by Hiroshi Aida ef a/., Japan Scientific Societies Press). 
However, the relationship between the mutation, Val59Ala, in horn and lysine production is not known. It can be ex- 

30 amined whether or not the mutation, Val59Ala, in horn is an effective mutation by introducing the mutation to the wild 
type strain and examining the lysine productivity of the resulting strain. On the other hand, it can be examined whether 
or not the mutation, Pro458Ser, in pyc is effective by introducing this mutation into a lysine-producing strain which has 
a deregulated lysine-bioxynthetic pathway and is free from the pyc mutation, and comparing the lysine productivity of 
the resulting strain with the parent strain. As such a lysine-producing bacterium, No. 58 strain (FERM BP-7134) was 

35 selected (hereinafter referred to the "lysine-producing No. 58 strain" or the "No. 58 strain"). Based on the above, it was 
determined that the mutation, Val59Ala, in horn and the mutation, Pro458Ser, in pyc were introduced into the wild type 
strain of Corynebacterium glutamicum ATCC 1 3032 (hereinafter referred to as the "wild type ATCC 1 3032 strain" or 
the "ATCC 13032 strain") and the lysine-producing No. 58 strain, respectively, using the gene replacement method. A 
plasmid vector pCES30 for the gene replacement for the introduction was constructed by the following method. 

40 [0376] A plasmid vector pCE53 having a kanamycin-resistant gene and being capable of autonomously replicating 
in Coryneform bacteria (/Wo/. Gen. Genet, 196: 175-178 (1984)) and a plasmid pMOB3 (ATCC 77282) containing a 
levansucrase gene (sacB) of Bacillus subtilis (Molecular Microbiology, 6: 1 1 95-1 204 (1 992)) were each digested with 
Psfl. Then, after agarose gel electrophoresis, a pCE53 fragment and a 2.6 kb DNA fragment containing sacBwere 
each extracted and purified using GENECLEAN Kit (manufactured by BIO 101). The pCE53 fragment and the 2.6 kb 

45 DNA fragment were ligated using Ligation Kit ver. 2 (manufactured by Takara Shuzo), introduced into the ATCC 1 3032 
strain by the electroporation method (FEMS Microbiology Letters, 65: 299 (1 989)), and cultured on BYG agar medium 
(medium prepared by adding 10 g of glucose, 20 g of peptone (manufactured by Kyokuto Pharmaceutical), 5 g of yeast 
extract (manufactured by Difco), and 16 g of Bactoagar (manufactured by Difco) to 1 liter of water, and adjusting its 
pH to 7.2) containing 25 u.g/ml kanamycin at 30°C for 2 days to obtain a transformant acquiring kanamycin-resistance. 

50 As a result of digestion analysis with restriction enzymes, it was confirmed that a plasmid extracted from the resulting 
transformant by the alkali SDS method had a structure in which the 2.6 kb DNA fragment had been inserted into the 
Psil site of pCE53. This plasmid was named pCES30. 

[0377] Next, two genes having a mutation point, horn and pyc, were amplified by PCR, and inserted into pCES30 
according to the TA cloning method (Bio Experiment Illustrated vol. 3, published by Shujunsha). Specifically, pCES30 
55 was digested with BamH \ (manufactured by Takara Shuzo), subjected to an agarose gel electrophoresis, and extracted 
and purified using GENECLEAN Kit (manufactured by BIO 1 01 ). The both ends of the resulting pCES30 fragment were 
blunted with DNA Blunting Kit (manufactured by Takara Shuzo) according to the attached protocol. The blunt-ended 
pCES30 fragment was concentrated by extraction with phenol/chloroform and precipitation with ethanol, and allowed 
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to react in the presence of Taq polymerase (manufactured by Roche Diagnostics) and dTTP at 70 P C for 2 hours so 
that a nucleotide, thymine (T), was added to the 3'-end to prepare a T vector of pCES30. 

[0378] Separately, chromosomal DNA was prepared from the lysine-producing B-6 strain according to the method 
of Saito et al. {Blochem. Biophys. Acta, 72: 61 9 (1 963)). Using the chromosomal DNA as a template, PCR was carried 
out with Pfu turbo DNA polymelase (manufactured by Stratagene). In the mutated horn gene, the DNAs having the 
nucleotide sequences represented by SEQ ID NOS:7002 and 7003 were used as the primer set. In the mutated pyc 
gene, the DNAs having the nucleotide sequences represented by SEQ ID NOS:7004 and 7005 were used as the primer 
set. The resulting PCR product was subjected to agarose gel electrophoresis, and extracted and purified using GENE- 
GLEAN Kit (manufactured by BIO 1 01 ). Then, the PCR product was allowed to react in the presence of Taq polymerase 
(manufactured by Roche Diagnostics) and dATP at 72 ft C for 1 0 minutes so that a nucleotide, adenine (A) was added 
tothe3'-end. 

[0379] The above pCES30 T vector fragment and the mutated horn gene (1 .7 kb) or mutated pyc gene (3.6 kb) to 
which the nucleotide A had been added of the PCR product were concentrated by extraction with phenol/chloroform 
and precipitation with ethanol, and then ligated using Ligation Kit ver. 2. The ligation products were introduced into the 
ATCC 13032 strain according to the electroporation method, and cultured on BYG agar medium containing 25 u.g/ml 
kanamycin at 30°C for 2 days to obtain kanamycin-resistant transformants. Each of the resulting transformants was 
cultured overnight in BYG liquid medium containing 25 |xg/ml kanamycin, and a plasmid was extracted from the culturing 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes, it was 
confirmed that the plasmid had a structure in which the 1 .7 kb or 3.6 kb DNA fragment had been inserted into pCES30. 
The plasmids thus constructed were named respectively pChom59 and pCpyc458. 

[0380] The Introduction of the mutations to the wild type ATCC 13032 strain and the lysine-producing No. 58 strain 
according to the gene replacement method was carried out according to the following method. Specifically, pChom59 
and pCpyc458 were introduced to the ATCC 13032 strain and the No. 58 strain, respectively, and strains in which the 
plasmid is integrated into the chromosomal DNA by homologous recombination were selected using the method of 
Ikeda et al. (Microbiology 144: 1863 (1998)). Then, the stains in which the second homologous recombination was 
carried out were selected by a selection method, making use of the fact that the Bacillus subtilis levansucrase encoded 
by pCES30 produced a suicidal substance (J. of BacterioL, 174: 5462 (1992)). Among the selected strains, strains in 
which the wild type horn and pyc genes possessed by the ATCC 1 3032 strain and the No. 58 strain were replaced with 
the mutated horn and pyogenes, respectively, were isolated. The method is specifically explained below. 
[0381] One strain was selected from the transformants containing the plasmid, pChom59 or pCpyc458, and the 
selected strain was cultured in BYG medium containing 20 u.g/ml kanamycin, and pCG11 (Japanese Published Exam- 
ined Patent Application No. 91827/94) was introduced thereinto by the electroporation method. pCG11 is a plasmid 
vector having a spectinomycin- resistant gene and a replication origin which is the same as pCE53. After introduction 
of the pCGII, the strain was cultured on BYG agar medium containing 20 u.g/ml kanamycin and 1 00 u.g/ml spectinomycin 
at 30°C for 2 days to obtain both the kanamycin- and spectinomycin-resistant transformant. The chromosome of one 
strain of these transformants was examined by the Southern blotting hybridization according to the method reported 
by Ikeda et al. (Microbiology, 144: 1863 (1998)). As a result, It was confirmed that pChom59 or pCpyc458 had been 
integrated into the chromosome by the homologous recombination of the Cambell type. In such a strain, the wild type 
and mutated horn or pyc genes are present closely on the chromosome, and the second homologous recombination 
is liable to arise therebetween. 

[0382] Each of these transformants (having been recombined once) was spread on Sue agar medium (medium 
prepared by adding 1 00 g of sucrose, 7 g of meat extract, 1 0 g of peptone, 3 g of sodium chloride, 5 g of yeast extract 
(manufactured by Difco), and 18 g of Bactoagar (manufactured by Difco) to 1 liter of water, and adjusting its pH 7.2) 
and cultured at 30°C for a day. Then the colonies thus growing were selected in each case. Since a strain in which the 
sacB gene is present converts sucrose into a suicide substrate, it cannot grow in this medium (J. BacterioL, 174: 5462 
(1 992)). On the other hand, a strain in which the sacB gene was deleted due to the second homologous recombination 
between the wild type and the mutated horn or pyc genes positioned closely to each other forms no suicide substrate 
and, therefore, can grow in this medium. In the homologous recombination, either the wild type gene or the mutated 
gene is deleted together with the sacB gene. When the wild type is deleted together with the sacB gene, the gene 
replacement into the mutated type arises. 

[0383] Chromosomal DNA of each the thus obtained second recombinants was prepared by the above method of 
Saito et al. PCR was carried out using Pfu turbo DNA polymerase (manufactured by Stratagene) and the attached 
buffer. In the horn gene, DNAs having the nucleotide sequences represented by SEQ ID NOS:7002 and 7003 were 
used as the primer set. Also, in the pyc gene was used, DNAs having the nucleotide sequences represented by SEQ 
ID NOS:7004 and 7005 were used as the primer set. The nucleotide sequences of the PCR products were determined 
by the conventional method so that it was judged whether the horn or pyc gene of the second recombinant was a wild 
type or a mutant. As a result, the second recombinant which were called HD-1 and No. 58pyc were target strains having 
the mutated horn gene and pyc gene, respectively. 
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(3) Lysine production test of HD-1 and No. 58pyc strains 

[0384] The HD-1 strain (strain obtained by incorporating the mutation, Val59Ala, in the horn gene into the ATCC 
1 3032 strain) and the No. 58pyc strain (strain obtained by incorporating the mutation, Pro458Ser, in the 
the lysine-producing No. 58 strain) were subjected to a culture test in a 5 I jar fermenter by using the ATCC 13032 
strain and the lysine-producing No. 58 strain respectively as a control. Thus lysine production was examined. 
[0385] After culturing on BYG agar medium at 30*C for 24 hours, each strain was inoculated into 250 ml of a seed 
medium (medium prepared by adding 50 g of sucrose, 40 g of corn steep liquor, 8.3 g of ammonium sulfate. 1 g of 
urea, 2 g of potassium dihydrogenphosphate, 0.83 g of magnesium sulfate heptahydrate. 10 mg of iron sulfate hep- 
tahydrate, 1 mg of copper sulfate pentahydrate, 1 0 mg of zinc sulfate heptahydrate, 1 0 mg of p-alan.ne, 5 mg of nicotinic 
acid 1 .5 mg of thiamin hydrochloride, and 0.5 mg of biotin to 1 liter of water, and adjusting its pH to 7.2, hen to which 
30 g of calcium carbonate had been added) contained in a 2 1 buffle-attached Erlenmeyer flask and cultured therein 
at 30°C for 1 2 to 1 6 hours. A total amount of the seed culturing medium was inoculated into 1 .400 ml of a main culture 
medium (medium prepared by adding 60 g of glucose. 20 g of corn steep liquor, 25 g of ammonium chlonde 2 .5 g of 
potassium dihydrogenphosphate. 0.75 g of magnesium sulfate heptahydrate. 50 mg of Iron sulfate heptahydrate, 13 
mg of manganese sulfate pentahydrate, 50 mg of calcium chloride, 6.3 mg of copper sulfate pentahydrate. 1 .3 mg of 
zinc sulfate heptahydrate, 5 mg of nickel chloride hexahydrate. 1 .3 mg of cobalt chloride hexahydrate, 1 .3 mg Of am- 
monium molybdenate tetrahydrate, 14 mg of nicotinic acid, 23 mg of p-alanine, 7 mg of thiamin hydrochloride and 
0 42 mg of biotin to 1 liter of water) contained in a 5 1 jar fermenter and cultured therein at 32'C, 1 wm and 800 rpm 
while controlling the pH to 7.0 with aqueous ammonia. When glucose in the medium had been consumed, a glucose 
feeding solution (medium prepared by adding 400 g glucose and 45 g of ammonium chloride to 1 liter of water) was 
continuously added. The addition of feeding solution was carried out at a controlled speed so as to maintain the dis- 
solved oxygen concentration within a range of 0.5 to 3 ppm. After culturing for 29 hours, the culture was terminated 
The cells were separated from the culture medium by centrlfugation and then L-lysine hydrochloride in the supernatant 
was quantified by high performance liquid chromatography (HPLC). The results are shown in Table 2 below. 



Table 2 


Strain 


L-Lysine hydrochloride yield (g/l) 


ATCC 13032 


0 


HD-1 


8 


No. 58 


45 


No. 58pyc 


51 



[0386] As is apparent from the results shown in Table 2. the lysine productivity was improved by Introducing the 
mutation, Val59Ala. in the horn gene or the mutation, Pro458Ser, In the pyc gene. Accordingly, it was found that the 
mutations are both effective mutations relating to the production of lysine. Strain. AHP-3, in which the mutetion. 
Val59Ala, in the horn gene and the mutation, Pro458Ser. in the pyc gene have been introduced into the wild typeATCC 
13032 strain together with the mutation, Thr331lle in the lysC gene has been deposited on December 5, 2000 in 
National Institute of Bioscience and Human Technology. Agency of Industrial Science and Technology (Higashl 1-1-3, 
Tsukuba-shi, Ibaraki, Japan) as FERM BP-7382. 



Example 3 

Reconstruction of lysine-producing strain based on genome information 

[0387] The lysine-producing mutant B-6 strain (Appl. Microbiol. Biotechnol., 3Z 269-273 (1989)), which has been 
constructed by multiple round random mutagenesis with NTG and screening from the wild type ATCC 13032 strain, 
produces a remarkably large amount of lysine hydrochloride when cultured in a jar at 32'C using glucose as a carbon 
source However, since the fermentation period is long, the production rate is less than 2. 1 g/l/h. Breeding to reconstitute 
only effective mutations relating to the production of lysine among the estimated at least 300 mutations introduced into 
the B-6 strain in the wild type ATCC 1 3032 strain was performed. 

(1) Identification of mutation point and effective mutation by comparing the gene nucleotide sequence of the B-6 strain 
with that of the ATCC 13032 strain 

[0388] As described above, the nucleotide sequences of genes derived from the B-6 strain were compared with the 
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corresponding nucleotide sequences of the ATCC 13032 strain genome represented by SEQ ID NOS:1 to 3501 and 
analyzed to Identify many mutation points accumulated in the chromosome of the B-6 strain. Among these, a mutation 
Val591Ala, in horn, a mutation, Thr311lle, in tysC, a mutation, Pro458Ser, In pyc and a mutation, Ala213Thr, in zwf 
were specified as effective mutations relating to the production of lysine. Breeding to reconstitute the 4 mutations in 
he wild type strain and for constructing of an industrially important lysine-producing strain was carried out according 
to the method shown below. 

(2) Construction of plasmid for gene replacement having mutated gene 

[0389] The plasmid for gene replacement, P Chom59, having the mutated hom gene and the plasmid for gene re- 
placement, pCpyc458, having the mutated pyc gene were prepared in the above Example 2(2). Plasmids for gene 
replacement having the mutated tysC and zwfwere produced as described below. 

[0390] The lysC and zwf having mutation points were amplified by PGR, and inserted into a plasmid for gene re- 
Ent« e T ent ' pCES30 ' accordjn 9 ^ the TA cloning method described in Example 2(2) (Bio Experiment Illustrated Vol 3) 
[0391] Separately, chromosomal DNA was prepared from the lysine-producing B-6 strain according to the above 
method of Saito etal. Using the chromosomal DNA as a template, PCR was carried out with Pfu turbo DNA polymerase 
(manufactured by Stratagene). In the mutated lysC gene, the DNAs having the nucleotide sequences represented by 
SEQ ID NOS:7006 and 7007 were used as the primer set. In the mutated zwf gene, the DNAs having the nucleotide 
sequences represented by SEQ ID NOS:7008 and 7009 as the primer set. The resulting PCR product was subjected 
to agarose gel electrophoresis, and extracted and purified using GENEGLEAN Kit (manufactured by BIO 101) Then 
the PCR product was allowed to react in the presence of Taq DNA polymerase (manufactured by Roche Diagnostics) 
and dATP at 72°C for 1 0 minutes so that a nucleotide, adenine (A), was added to the 3'-end. 

[0392] The above pCES30 T vector fragment and the mutated lysC gene (1 .5 kb) or mutated zwf gene (2.3 kb) to 
which the nucleotide A had been added of the PCR product were concentrated by extraction with phenol/chloroform 
and precipitation with ethanol, and then ligated using Ligation Kit ver. 2. The ligation products were introduced into the * 
ATCC 13032 strain according to the electroporation method, and cultured on BYG agar medium containing 25 ug/ml 
kanamycin at 30*C for 2 days to obtain kanamycin-resistant transformants. Each of the resulting transformants was 
cultured overnight in BYG liquid medium containing 25 u.g/ml kanamycin, and a plasmid was extracted from the culturing 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes it was 
confirmed that the plasmid had a structure in which the 1 .5 kb or 2.3 kb DNA fragment had been inserted into pCES30. 
The plasmids thus constructed were named respectively pClysC311 and pCzwf213. 

(3) Introduction of mutation, Thr311 lie, in iysC into one point mutant HD-1 

[0393] Since the one mutation point mutant HD-1 in which the mutation, Val59Ala, in hom was introduced into the 
wild type ATCC 1 3032 strain had been obtained in Example 2(2), the mutation, Thr31 1 lie, in IysC was introduced into 
the HD-1 strain using P ClysC311 produced in the above (2) according to the gene replacement method described in 
Example 2(2). PCR was carried out using chromosomal DNA of the resulting strain and, as the primer set, DNAs having 
the nucleotide sequences represented by SEQ ID NOS:7006 and 7007 in the same manner as in Example 2(2). As a 
result of the fact that the nucleotide sequence of the PCR product was determined in the usual manner, it was confirmed 
that the strain which was named AH D-2 was a two point mutant having the mutated IysC gene in addition to the mutated 
hom gene. 

(4) Introduction of mutation, Pro458Ser, in pyc into two point mutant AHD-2 

[0394] The mutation, Pro458Ser, In pyc was introduced into the AHD-2 strain using the pCpyc458 produced in Ex- 
ample 2(2) by the gene replacement method described in Example 2(2). PCR was carried out using chromosomal 
of tne suiting strain and, as the primer set, DNAs having the nucleotide sequences represented by SEQ ID 
NOS:7004 and 7005 in the same manner as in Example 2(2). As a result of the fact that the nucleotide sequence of 
the PCR product was determined in the usual manner, it was confirmed that the strain which was named AHD-3 was 
a three point mutant having the mutated pyc gene in addition to the mutated hom gene and IysC gene. 

(5) Introduction of mutation, Ala213Thr, in zwf into three point mutant AHP-3 

[0395] The mutation, Ala213Thr, in zwf was introduced into the AHP-3 strain using the P Czwf458 produced in the 
above (2) by the gene replacement method described in Example 2(2). PCR was carried out using chromosomal DNA 
of the resulting strain and, as the primer set, DNAs having the nucleotide sequences represented by SEQ ID NOS: 
7008 and 7009 in the same manner as in Example 2(2). As a result of the fact that the nucleotide sequence of the PCR 
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product was determined in the usual manner, it was confirmed that the strain which was named ^ 
mutant having the mutated zwf gene in addition to the mutated horn gene, lysC gene and pyc gene. 



(6) Lysine production test on HD-1 , AHD-2, AHP-3 and APZ-4 strains 

[0396] The HD-1 , AHD-2, AHP-: 
fermenter in accordance with the n 
[0397] Table 3 shows the results 



[0396] The HD-1 , AHD-2, AHP-3 and APZ-4 strains obtained above were subjected to a culture test in a 5 I jar 
fermenter in accordance with the method of Example 2(3). 



Table 3 



Strain 


L-Lysine hydrochloride (g/l) 


Productivity (g/I/h) 


HD-1 


8 


0.3 


AHD-2 


73 


2.5 


AHP-3 


80 


2.8 


APZ-4 


86 


3.0 



r03981 Since the Ivsine-producing mutant B-6 strain which has been bred based on the random mutation and selection 
shows a p^vTof S^nan 2 1 g/I/h, the APZ-4 strain showing a high productivity of 3.0 g/Vh Is useful In industry. 

(7) Lysine fermentation by APZ-4 strain at high temperature 

103991 The APZ-4 strain, which had been reconstructed by introducing 4 effective mutations into the wild type strain, 
w^s subj^e<^to*e cutturingtest in a 5 1 jar fermenter in the same manner as in Example 2(3), except that the cultunng 
temperature was changed to 40°C. 
[0400] The results are shown in Table 4. 

Table 4 



Temperature (°C) 


L-Lysine hydrochloride (g/l) 


Productivity (g/l/h) 


32 
40 


86 
95 


3.0 
3.3 



r0401l As is apparent from the results shown in Table 4, the lysine hydrochloride titer and productivity .n culturing tf 
a Stan tenwaT of 40'C comparable to those at 32'C were obtained. In the mutated and b red lysi n ejsroducing B- 
6 s rain Snstmcted by repeating random mutation and selection, the growth and the lysine productive are owered 
LZoe™ exceeding 34-C so that lysine fermentation cannot be carried out. whereas lysine fermentation can 
be Sed out "stag the APZ-4 strain at a high temperature of 40-C so that the load of cooling is greatly reduced and 
fuslnrstrS useful. The lysine fermentation at high temperatures can be achieved by reflecting the high temperature 
adaptability inherently possessed by the wild type strain on the APZ-4 strain. „ mw : HM a 

[0402] As demonstrated in the reconstruction of the lysine-producing strain, the present invention provides novel 
LreedLg method effects for eliminating the problems in the conventional mutants and -^JMS 
tageous strains. This methodology which reconstitutes the production strain by reconstituting the effect ve mutation is 
an approach which is efficiently carried out using the nucleotide sequence information of the genome disclosed ,n the 
present invention, and its effectiveness was found for the first time in the present invention. 

Example 4 

Production of DNA microarray and use thereof 

T04031 A DNA microarray was produced based on the nucleotide sequence information of the ORF deduced from 
XI nucleotide sequences of lorynebactenum gMamicun, ATCC 1 3032 using software, and genes of which ex- 
pression is fluctuated depending on the carbon source during culturing were searched. 

(1) Production of DNA microarray 

[0404] chromosomal DNA ww preparedly 
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al. ( Biochem. Biophys. Acta, 72. 619 (1963)). Based on 24 genes having the nucleotide sequences represented by 
SEQ ID NOS:207. 3433, 281, 3435, 3439, 766, 3445, 1226, 1229, 3448, 3451, 3453, 3455, 1743, 3470, 2132, 3476 
3477, 3485, 3488, 3489, 3494, 3496, and 3497 from the ORFs shown In Table 1 deduced from the full genome nuclei 
otide sequence of Corynebacterium giutamicum ATCC 13032 using software and the nucleotide sequence of rabbit 
globln gene (GenBank Accession No. V00882) used as an Internal standard, oligo DNA primers for PCR amplification 
represented by SEQ ID NOS:7010 to 7059 targeting the nucleotide sequences of the genes were synthesized in a 
usual manner. 

[0405] As the oligo DNA primers used for the PCR, 

[0406] DNAs having the nucleotide sequence represented by SEQ ID NOS:7010 and 7011 were used forthe ampli- 
fication of the DNA having the nucleotide sequence represented by SEQ ID NO:207, 

[0407] DNAs having the nucleotide sequence represented by SEQ ID NOS:7012 and 7013 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3433, 

[0408] DNAs having the nucleotide sequence represented by SEQ ID NOS:7014 and 7015 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:281 , 

[0409] DNAs having the nucleotide sequence represented by SEQ ID NOS:7016 and 7017 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3435, 

[0410] DNAs having the nucleotide sequence represented by SEQ ID NOS:7018 and I 7019 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3439, 

[0411] DNAs having the nucleotide sequence represented by SEQ ID NOS:7020 and 7021 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:765, 

[0412] DNAs having the nucleotide sequence represented by SEQ ID NOS:7022 and 7023 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3445, 

[0413] DNAs having the nucleotide sequence represented by SEQ ID NOS:7024 and 7025 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:1226, 

[0414] DNAs having the nucleotide sequence represented by SEQ ID NOS:7026 and 7027 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO.1229, 

[0415] DNAs having the nucleotide sequence represented by SEQ ID NOS:7028 and 7029 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3448, 

[0416] DNAs having the nucleotide sequence represented by SEQ ID NOS:7030 and 7031 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3451 , 

[0417] DNAs having the nucleotide sequence represented by SEQ ID NOS:7032 and 7033 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3453, 

[0418] DNAs having the nucleotide sequence represented by SEQ ID NOS:7034 and 7035 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3455, 

[0419] DNAs having the nucleotide sequence represented by SEQ ID NOS:7036 and 7037 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:1743, 

[0420] DNAs having the nucleotide sequence represented by SEQ ID NOS:703B and 7039 were used for the am- - 
plificatlon of the DNA having the nucleotide sequence represented by SEQ ID NO:3470, 

[0421] DNAs having the nucleotide sequence represented by SEQ ID NOS:7040 and 7041 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:2132, 

[0422] DNAs having the nucleotide sequence represented by SEQ ID NOS:7042 and 7043 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3476, 

[0423] DNAs having the nucleotide sequence represented by SEQ ID NOS.7044 and 7045 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3477, 

[0424] DNAs having the nucleotide sequence represented by SEQ ID NOS:7046 and 7047 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3485, 

[0425] DNAs having the nucleotide sequence represented by SEQ ID NOS:7048 and 7049 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3488, 

[0426] DNAs having the nucleotide sequence represented by SEQ ID NOS:7050 and 7051 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3489, 

[0427] DNAs having the nucleotide sequence represented by SEQ ID NOS:7052 and 7053 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3494, 

[0428] DNAs having the nucleotide sequence represented by SEQ ID NOS:7054 and 7055 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3496, 

[0429] DNAs having the nucleotide sequence represented by SEQ ID NOS:7056 and 7057 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3497, and 

[0430] DNAs having the nucleotide sequence represented by SEQ ID NOS:7058 and 7059 were used for the am- 
plification of the DNA having the nucleotide sequence of the rabbit globin gene, 
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SYSTEM (manS^eS b^PPon Laser & Electronics Lab.) according to the manufacture's instruct-on*. 
(2) Synthesis of fluorescence labeled cDNA 

was further inoculated Into 30 ml of a minimum medium (medium prepared by add.ng 5 g of a k mm ° ni " m /. u J a ^- 5 ^ ° T 

o^and It 25- C foTl 0 Zte^nd then al 42'C for 1 1 0 minutes. The RNA extracted from the ce^g glucose as 
!u onri th« RNA extracted from the cells using ammonium acetate were labeled with Cy5-dUTP and 

C^d a ?TP S^^JJES^^ -acSon. the RNA was digested by adding 1.5 £ of 1 mo W 

ulactured by QIAGEN) according to the manufacture's instructions to give a volume of 1 0 ^l. 

(3) Hybridization 

rnil „. . lltraHuh /no iih fmanufactured bv Ambion) and the fluorescence-labeled cDNA solution (1 0 were mixed 

50°C, and the washing was carried out at 25°C. 

(4) Fluorescence analysis 

[0434) Th.« U or«oenc.«»«c<.achONAar w ^^ 

of the rabbit globin used as the internal standard and the Cy3/Cy5 ratios. 



Table 5 



SEQ ID NO 


Cy3 intensity 


Cy5 intensity 


Cy3/Cy5 


207 


5248 


3240 


1.62 



230 



EP 1 108 790 A2 



Table 5 (continued) 



SEQ ID NO 


Cy3 intensity 


Cy5 intensity 


Cy3/Cy5 


3433 


2239 


2694 


0.83 


2B1 


2370 


2595 


0.91 


3435 


2566 


2515 


1.02 


3439 


5597 


6944 


0.81 


765 


6134 


4943 


1.24 


3455 


1169 


1284 


0.91 


1226 


1301 


1493 


0.87 


1229 


1168 


1131 


1.03 


3448 


1187 


1594 


0.74 


3451 


2845 


3859 


0.74 


3453 


3498 


1705 


2.05 


3455 


1491 


1144 


1.30 


1743 


1972 


1841 


1.07 


3470 


4752 


3764 


1.26 


2132 


1173 


1085 


1.08 


3476 


1847 


1420 


1.30 


3477 


1284 


1164 


1.10 


3485 


4539 


8014 


0.57 


3488 


34289 


1398 


24.52 


3489 


43645 


1497 


29.16 


3494 


3199 


2503 


1.28 


3496 


3428 


2364 


1.45 


3497 


3848 


3358 


1.15 



[0436] The ORF function data estimated by using software were searched for SEQ ID NOS:3488 and 3489 showing 
remarkably strong Cy3 signals. As a result, it was found that SEQ ID NOS:3488 and 3489 are a maleate synthase 
gene and an isocitrate lyase gene, respectively. It is known that these genes are transcriptionally induced by acetic 
acid in Corynebacterium glutamicum {Archives of Microbiology, 16&. 262-269 (1997)). 

[0437] As described above, a gene of which expression is fluctuates could be discovered by synthesizing appropriate 
oligo DNA primers based on the ORF nucleotide sequence information deduced from the full genomic nucleotide 
sequence information of Corynebacterium glutamicum ATCC 1 3032 using software, amplifying the nucleotide sequenc- 
es of the gene using the genome DNA of Corynebacterium glutamicum as a template in the PCR reaction, and thus 
producing and using a DNA microarray. 

[0438] This Example shows that the expression amount can be analyzed using a DNA microarray in the 24 genes. 
On the other hand, the present DNA microarray techniques make it possible to prepare DNA microarrays having thereon 
several thousand gene probes at once. Accordingly, it is also possible to prepare DNA microarrays having thereon all 
of the ORF gene probes deduced from the full genomic nucleotide sequence of Corynebacterium glutamicum ATCC 
1 3032 determined by the present invention, and analyze the expression profile at the total gene level of Corynebac- 
terium glutamicum using these arrays. 

Example 5 

Homology search using Corynebacterium glutamicum genome sequence 
(1) Search of adenosine deaminase 

[0439] The amino acid sequence (ADD_ECOLI) of Escherichia coll adenosine deaminase was obtained from Swiss- 
prot Database as the amino acid sequence of the protein of which function had been confirmed as adenosine deaminase 
(EC3.5.4.4). By using the full length of this amino acid sequence as a query, a homology search was carried out on a 
nucleotide sequence database of the genome sequence of Corynebacterium glutamicum or a database of the amino 
acids in the ORF region deduced from the genome sequence using FASTA program (Proc. Natl. Acad. Sci. ISA, 85: 
2444-2448 (1988)). A case where E-value was le' 10 or less was judged as being significantly homologous. As a result, 
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no sequence significantly homologous with the Escherichia coli adenosine deaminase was found in the nucleotide 
sequence database of the genome sequence of Corynebacterium glutamicum or the database of the amino acid se- 
quences in the ORF region deduced from the genome sequence. Based on these results, it is assumed that Coryne- 
bacterium glutamicum contains no ORF having adenosine deaminase activity and thus has no activity of converting 
adenosine into inosine. 

(2) Search of glycine cleavage enzyme 

[0440] The sequences (GCSP_ECOLI, GCST_ECOLI and GCSH_ECOLI) of glycine decarboxylase, aminomethyl 
transferase and an aminomethyl group carrier each of which is a component of Escherichia coli glycine cleavage 
enzyme as the amino acid sequence of the protein, of which function had been confirmed as glycine cleavage enzyme 
(EC2. 1.2.10), were obtained from Swiss-prot Database. 

[0441] By using these full-length amino acid sequences as a query, a homology search was carried out on a nucleotide 
sequence database of the genome sequence of Corynebacterium glutamicum or a database of the ORF amino acid 
sequences deduced from the genome sequence using FASTA program. A case where E-value was le -10 or less was 
judged as being significantly homologous. As a result, no sequence significantly homologous with the glycine decar- 
boxylase, the aminomethyl transferase or the aminomethyl group carrier each of which is a component of Escherichia 
coli glycine cleavage enzyme, was found in the nucleotide sequence database of the genome sequence of Coryne- 
bacterium glutamicum or the database of the ORF amino acid sequences estimated from the genome sequence. Based 
on these results, it is assumed that Corynebacterium glutamicum contains no ORF having the activity of glycine de- 
carboxylase, aminomethyl transferase or the aminomethyl group carrier and thus has no activity of the glycine cleavage 
enzyme. 

(3) Search of IMP dehydrogenase 

[0442] The amino acid sequence (IMDH ECOLI) of Escherichia coli IMP dehydrogenase as the amino acid sequence 
of the protein, of which function had been confirmed as IMP dehydrogenase (EC1 .1 .1 .205), was obtained from Swiss- 
prot Database. By using the full length of this amino acid sequence as a query, a homology search was carried out on 
a nucleotide sequence database of the genome sequence of Corynebacterium glutamicum or a database of the ORF 
amino acid sequences predicted from the genome sequence using FASTA program. A case where E-value was le* 10 
or less was judged as being significantly homologous. As a result, the amino acid sequences encoded by two ORFs, 
namely, an ORF positioned in the region of the nucleotide sequence No. 61 5336 to 61 6853 (or ORF having the nucle- 
otide sequence represented by SEQ ID NO:672) and another ORF positioned in the region of the nucleotide sequence 
No. 616973 to 618094 (or ORF having the nucleotide sequence represented by SEQ ID NO:674) were significantly 
homologous with the ORFs of Escherichia coli IMP dehydrogenase. By using the above-described predicted amino 
acid sequence as a query in order to examine the similarity of the amino acid sequences encoded by the ORFs with 
IMP dehydrogenases of other organisms in greater detail, a search was carried out on GenBank (http://www.ncbi.nlm. 
nih.gov/) nr-aa database (amino acid sequence database constructed on the basis of GenBankCDS translation prod- 
ucts, PDB database, Swiss-Prot database, PIR database, PRF database by eliminating duplicated registrations) using 
BLAST program. As a result, both of the two amino acid sequences showed significant homologies with IMP dehdy- 
rogenases of other organisms and clearly higher homologies with IM P dehdyrogenases than with amino acid sequences 
of other proteins, and thus, it was assumed that the two ORFs would function as IM P dehydrogenase. Based on these 
results, it was therefore assumed that Corynebacterium glutamicum has two ORFs having the IMP dehydrogenase 
activity. 

Example 6 

Proteome analysis of proteins derived from Corynebacterium glutamicum 

(1) Preparations of proteins derived from Corynebacterium glutamicum ATCC 13032, FERM BP-7134 and FERM BP- 
158 

[0443] Culturing tests of Corynebacterium glutamicum ATCC 1 3032 (wild type strain), Corynebacterium glutamicum 
FERM BP-7134 (lysine-p reducing strain) and Corynebacterium glutamicum (FERM BP-158, lysine-highly producing 
strain) were carried out in a 5 1 jar fermenter according to the method in Example 2(3). The results are shown in Table 6. 
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Table 6 



10 



15 



20 



25 



30 



35 



40 



50 



55 



Strain 


L-Lysine yield (g/l) 


ATCC 13032 


0 


FERM BP-7134 


45 


FERM BP-158 


60 



[0444] After culturing, cells of each strain were recovered by centrlf ugation. These cells were washed with Tris-HCI 
buffer (1 0 mmol/l Tris-HCI, pH 6.5, 1 .6 mg/ml protease Inhibitor (COM PLETE; manufactured by Boehringer Mannheim)) 
three times to give washed cells which could be stored under freezing at -80*C. The freeze-stored cells were thawed 
before use, and used as washed cells. 

[0445] The washed cells described above were suspended in a disruption buffer (1 0 mmol/l Tris-HCI, pH 7.4 5 mmol/ 
I magnesium chloride, 50 mg/l RNase, 1.6 mg/ml protease inhibitor (COMPLETE: manufactured by Boehringer Man- 
nheim)), and disrupted with a disrupter (manufactured by Brown) undercooling. To the resulting disruption solution, 
DNase was added to give a concentration of 50 mg/l, and allowed to stand on ice for 10 minutes. The solution was 
centrifuged (5,000 x g, 15 minutes, 4°C) to remove the undisrupted cells as the precipitate, and the supernatant was 
recovered. 

[0446] To the supernatant, urea was added to give a concentration of 9 mol/l, and an equivalent amount of a lysis 
buffer (9.5 mol/l urea, 2% NP-40, 2% Ampholine, 5% mercaptoethanol, 1.6 mg/ml protease inhibitor (COMPLETE; 
manufactured by Boehringer Mannheim) was added thereto, followed by thoroughly stirring at room temperature for 
dissolving. 

[0447] After being dissolved, the solution was centrifuged at 12,000 x g for 15 minutes, and the supernatant was 
recovered. 

[0448] To the supernatant, ammonium sulfate was added to the extent of 80% saturation, followed by thoroughly 
stirring for dissolving. 

[0449] After being dissolved, the solution was centrifuged (16,000 x g, 20 minutes, 4°C), and the precipitate was 
recovered. This precipitate was dissolved in the lysis buffer again and used in the subsequent procedures as a protein 
sample. The protein concentration of this sample was determined by the method for quantifying protein of Bradford. 

(2) Separation of protein by two dimensional electrophoresis 

[0450] The first dimensional electrophoresis was carried out as described below by the isoelectric electrophoresis 
method. 

[0451] A molded dry IPG strip gel (pH 4-7, 13 cm, Immobiline DryStrips; manufactured by Amersham Pharmacia 
Biotech) was set in an electrophoretlc apparatus (Multiphor II or IPGphor; manufactured by Amersham Pharmacia 
Biotech) and a swelling solution (8 mol/l urea, 0.5% Triton X-100, 0.6% dithiothreitol, 0.5% Ampholine, pH 3-10) was 
packed therein, and the gel was allowed to stand for swelling 1 2 to 1 6 hours. 

[0452] The protein sample prepared above was dissolved In a sample solution (9 mol/l urea, 2% CHAPS, 1% dithi- 
othreitol, 2% Ampholine, pH 3-10), and then about 100 to 500 u.g (in terms of protein) portions thereof were taken and 
added to the swollen IPG strip gel. 
[0453] The electrophoresis was carried out in the 4 steps as defined below under controlling the temperature to 20°C: 

step 1 : 1 hour under a gradient mode of 0 to 500V; 
45 step 2: 1 hour under a gradient mode of 500 to 1 ,000 V; 

step 3: 4 hours under a gradient mode of 1 ,000 to 8,000 V; and 
step 4: 1 hour at a constant voltage of 8,000 V. 

[0454] After the isoelectric electrophoresis, the IPG strip gel was put off from the holder and soaked in an equilibration 
buffer A (50 mmol/l Tris-HCI, pH 6.8, 30% glycerol, 1% SDS, 0.25% dithiothreitol) for 15 minutes and another equili- 
bration buffer B (50 mmol/l Tris-HCI, pH 6.8, 6 mol/l urea, 30% glycerol, 1 % SDS, 0.45% iodo acetamide) for 1 5 minutes 
to sufficiently equilibrate the gel. 

[0455] After the equilibrium, the IPG strip gel was lightly rinsed in an SDS electrophoresis buffer (1 .4% glycine, 0. 1 % 
SDS, 0.3% Tris-HCI, pH 8.5), and the second dimensional electrophoresis depending on molecular weight was carried 
out as described below to separate the proteins. 

[0456] Specifically, the above IPG strip gel was closely placed on 14% polyacrylamide slub gel (14% polyacrylamide 
0.37% bisacrylamide, 37.5 mmol/l Tris-HCI, pH 8.8, 0.1% SDS, 0.1% TEMED, 0.1% ammonium persulfate) and sub- 
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jected to electrophoresis under a constant voltage of 30 mA at 20°C for 3 hours to separate the proteins. 

(3) Detection of protein spot 

[0457] Coomassle staining was performed by the method of Gorg et a.. (Bectrophoresis 

slub gel after the second dimensional electrophoresis. Specifically, the slub gel was stained under shaWng at 25 C_ to 
about 3 hours, the excessive coloration was removed with a decoloring solution, and the gel was thoroughly washed 

ro«S 8ti '^ e u' te are shown in Fig. 2. The proteins derived from the ATCC 1 3032 strain (Fig. 2A). FERM BP-71 34 
strain (Fig. 2B) and FERM BP-158 strain (Fig. 2C) could be separated and detected as spots. 

(4) In-gel digestion of detected protein spot 

r04591 The detected spots were each cut out from the gel and transferred into siliconized tube, and 400 jJ of 1 00 
mmo 1/1 ammonL bicamonate : acetonitri.e solution (1 :1 . v/v) was added thereto followed by ^XTo^reo 
fTe^ze-dried as such. To the dried gel. 1 0 nl of a lysylendopeptidase (LysC) solution (manufactured by WAKO. prepared 
Z U% ISionteining 50 mmoUl ammonium bicarbonate to give a ^T^^J^^fS^Z 
ael was allowed to stand for swelling at 0»C for 45 minutes, and then allowed to stand at 37 C fo 1 6 hours. After 
S ^rSsC Son. 20 jd of an extracting solution (a mixture of 60% acetonitnle and 5% formte acid .was 
Ldded followed oy ultrasonication at room temperature for 5 minutes to disrupt the gel. After the disruption the extract 
was recovered by centrifugation (12.000 rpm. 5 minutes, room temperature). This operation was repeated twtee to 
rov^tte^ 

To the concentrate. 20 ul of 0.1% trifluoroacetic acid was added, followed by thoroughly stirring, and the mixture was 
lubJelS to desatting Ssing ZipTlp (manufactured by Millipore). The protein absorbed on the earners of ZipTip was 
eluted with 5 ul of a-cyano-4-hydroxycinnamic acid for use as a sample solution for analysis. 

(5)Mass spectrometry and aminoacid sequence analysis of protein spot with matrix assisted laser desorption ionization 
time of flight mass spectrometer (MALDI-TOFMS) 

[04601 The sample solution for analysis was mixed in the equivalent amount with a solution of £ P«P« d « m ^ u ^ e 
mass caliration (300 nmol/l Angiotensin II. 300 nmol/l Neurotensin, 150 nmol/l ACTHchp 18-39 2.3 umol/1 bovine 
L* B chSS?and i J, of the obtained so.ution was spotted on a stainless probe and crystallized by spontaneously 

££] As measurement instruments. REFLEX MALDI-TOF mass spectrometer (manufactured by Bruker) and an 

by measuring 30 ties at an accelerated voltage of 19.0 kV and a detector voltage of 1 .50 kV under reflector mode 
renditions Mass calibration was carried out by the internal standard method. 

2? Th^D (po^soume decay) analysis was carried out using integration spectra obtained by successively 
alterlna the reflection voltage and the detector voltage at an accelerated voltage of 27.5 kV. 

r?464] T^emTsses and amino acid sequences of the peptide fragments derived from the protein spot after digestion 
were thus determined. 

(6) Identification of protein spot 

[0465] From the amino acid sequence information of the digested peptide fragments derived from the protein spot 

obtSed in the above (5). ORFs corresponding to the protein were searched on the ge •n^ -quence database of 

Corvnebacterium qlutamicum ATCC 1 3032 as constructed in Example 1 to identify the protein. 

[04^ T^e idTntmcation of the protein was carried out using MS-FIt program and MS-Tag program of intranet protein 

prospector. 

(a) Search and identification of gene encoding high-expression protein 

r04671 In the proteins derived from Corynebacterium glutamicum ATCC 13032 showing high expression amounts in 
Saining shown in Fig. 2A, the proteins corresponding to Spots-1 , 2 ,3,4and5were identified bytheabovemethod^ 
SLl Is a resul it was found that Spot-1 corresponded to enolase which was a protein having the ammo acid 
sequence of SEo S NO 4585; Spot-2 corresponded to phosphoglycelate kinase which was a protein havmg the ammo 
a?d seqTnce of SEQ ID NO:5254; Spot-3 corresponded to g^ceraldehyde-3-phosphate dehydrogenase wh,ch was 
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a protein having the amino acid sequence represented by SEQ ID NO:5255; Spot-4 corresponded to fructose bis- 
phosphate aldolase which was a protein having the amino acid sequence represented by SEQ ID NO:6543; and Spot- 
5 corresponded to triose phosphate isomerase which was a protein having the amino acid sequence represented bv 
SEQ ID NO:5252. " ' 

[0469] These genes, represented by SEQ ID NOS:1085, 1754, 1775, 3043 and 1752 encoding the proteins corre- 
sponding to Spots-1, 2, 3, 4 and 5, respectively, encoding the known proteins are important in the central metabolic 
pathway for maintaining the life of the microorganism. Particularly, it is suggested that the genes of Spots-2, 3 and 5 
form an operon and a high-expression promoter is encoded in the upstream thereof (J. of EacterioL, 174- 6067-6086 
(1992)). 

[0470] Also, the protein corresponding to Spot-9 in Fig. 2 was identified in the same manner as described above, 
and it was found that Spot-9 was an elongation factor Tu which was a protein having the amino acid sequence repre- , 
sented by SEQ ID No:6937, and that the protein was encoded by DNA having the nucleotide sequence represented 
by SEQ ID No:3437. 

[0471] Based on these results, the proteins having high expression level were identified by proteome analysis using 
the genome sequence database of Corynebacterium glutamicum constructed in Example 1 . Thus, the nucleotide se- 
quences of the genes encoding the proteins and the nucleotide sequences upstream thereof could be searched simul- 
taneously. Accordingly, It is shown that nucleotide sequences having a function as a high-expression promoter can be 
efficiently selected. 



(b) Search and identification of modified protein 

[0472] Among the proteins derived from Corynebacterium glutamicum FERM BP-7134 shown in Fig. 2B, Spots-6, 
7 and B were identified by the above method. As a result, these three spots all corresponded to catalase which was a 
protein having the amino acid sequence represented by SEQ ID NO:3785. 

[0473] Accordingly, all of Spots-6, 7 and 8 detected as spots differing in isoelectric mobility were all products derived 
from a catalase gene having the nucleotide sequence represented by SEQ ID No:285. Accordingly, it is shown that 
the catalase derived from Corynebacterium glutamicum FERM BP-7134 was modified after the translation. 
[0474] Based on these results, it is confirmed that various modified proteins can be efficiently searched by proteome 
analysis using the genome sequence database of Corynebacterium glutamicum constructed in Example 1 . 

(c) Search and identification of expressed protein effective in lysine production 

[0475] It was found out that in Fig. 2A (ATCC 13032: wild type strain), Fig. 2B (FERM BP-7134: lysine-producing 
strain) and Fig. 2C (FERM BP-158: lysine-highly producing strain), the catalase corresponding to Spot-8 and the elon- 
gation factor Tu corresponding to Spot-9 as identified above showed the higher expression level with an increase in 
the lysine productivity. 

[0476] Based on these results, It was found that hopeful mutated proteins can be efficiently searched and identified 
in breeding aiming at strengthening the productivity of a target product by the proteome analysis using the genome 
sequence database of Corynebacterium glutamicum constructed in Example 1 . 

[0477] Moreover, useful mutation points of useful mutants can be easily specified by searching the nucleotide se- 
quences (nucleotide sequences of promoter, ORF, or the like) relating to the identified proteins using the above data- 
base and using primers designed on the basis of the sequences. As a result of the fact that the mutation points are 
specified, industrially useful mutants which have the useful mutations or other useful mutations derived therefrom can 
be easily bred. 

[0478] While the Invention has been described in detail and with reference to specific embodiments thereof, it will 
be apparent to one of skill in the art that various changes and modifications can be made therein without departing 
from the spirit and scope thereof. All references cited herein are incorporated in their entirety. 



Claims 



1. A method for at least one of the following: 

(A) identifying a mutation point of a gene derived from a mutant of a coryneform bacterium, 

(B) measuring an expression amount of a gene derived from a coryneform bacterium, 

(C) analyzing an expression profile of a gene derived from a coryneform bacterium, 

(D) analyzing expression patterns of genes derived from a coryneform bacterium, or 

(E) identifying a gene homologous to a gene derived from a coryneform bacterium, 
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said method comprising 



s£ f™, -"Sisssa- • ■ - ,o " 200 ~'* u "" ,6 B * s,>s 01 

the first or second polynucleotides, polynucleotide derived from a co- 

labeled polynucleotide to be examined, under hybrid.zat.on conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 

2 The method according to Calm 1 . wherein the coryneform bacterium te a microorganism be.onging to the genus 
S« the'genus BrevibzcteHum. or the genus Micrcbactenun,. 

, Themethodaccordingtociaim^hereinthemicro^ 

an organic acid, and analogues thereof. 

5. The method according to ciaim 1 , wherein the pofynuc.eot.de to be examined is derived from Esc^/a «* 

6. A polynucleotide array, comprising: 

tinuous bases of the first or second polynucleotides, and 
a solid support adhered thereto. 
7 A po*nuc,eotide comprising the nuc.eotide sequence represented by SEQ .0 NO:1 or a po*nuc.eot.de having a 
homology of at least 80% with the polynucleotide. 

comprising 10 to 200 continuous based. 

12. A recombinant DNA comprising the polynucleotide of any one of claims 8 to 11 . 

13 . A transformed comprising the poiynucleotide of any one of Cairns 8 to 11 or the recombinant DNA of claim 12. 

14. A method for producing a polypeptide, comprising: 
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culturing the transfomnant of claim 13 in a medium to produce and accumulate a polypeptide encoded by the 
polynucleotide of claim 8 or 9 In the medium, and 
recovering the polypeptide from the medium. 

s 15. A method for producing at least one of an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof, comprising: 

culturing the transformant of claim 13 in a medium to produce and accumulate at least one of an amino acid, 
a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof in the medium, and 
10 recovering the at least one of the amino acid, the nucleic acid, the vitamin, the saccharide, the organic acid, 

and analogues thereof from the medium. 

16. A polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from SEQ ID NOS:2 to 
3431. 



15 



17. A polypeptide comprising the amino acid sequence selected from SEQ ID NOS:3502 to 6931 . 



18. The polypeptide according to claim 16 or 17, wherein at least one amino acid is deleted, replaced, inserted or 
added, said polypeptides having an activity which is substantially the same as that of the polypeptide without said 

20 at least one amino acid deletion, replacement, insertion or addition. 

1 9. A polypeptide comprising an amino acid sequence having a homology of at least 60% with the amino acid sequence 
of the polypeptide of claim 1 6 or 1 7, and having an activity which is substantially the same as that of the polypeptide. 

25 20. An antibody which recognizes the polypeptide of any one of claims 16 to 1 9. 

21. A polypeptide array, comprising: 

at least one polypeptide or partial fragment polypeptide selected from the polypeptides of claims 1 6 to 1 9 and 
30 partial fragment polypeptides of the polypeptides, and 

a solid support adhered thereto. 

22. A polypeptide array, comprising: 

35 at least one antibody which recognizes a polypeptide or partial fragment polypeptide selected from the polypep- 

tides of claims 1 6 to 1 9 and partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

23. A system based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
40 form bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID NOS:1 
to 3501 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input Information; 

45 (Hi) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 

1 to 3501 with the target sequence or target structure motif information, recorded by the data storage device 
for screening and analyzing nucleotide sequence information which Is coincident with or analogous to the 
target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

50 

24. A method based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501, target se- 
55 quence information or target structure motif information into a user input device; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501 with 
the target sequence or target structure motif information; and 
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(iv) screening and analyzing nucleotide sequence Information which is coincident with or analogous to the 
target sequence or target structure motif information. 

25. A system based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

iin a comoaratorthat compares the at least one amino acid sequence mformation selected from SEQ ID NOS. 
3502 to 700? w th ?ne tTrget sequence or target structure motif information, recorded by the data storage 
devfce for screelg and analyzing amino acid sequence information which is coincident with or analogous to 
the target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

26. A method based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , and target 
sequence information or target structure motif information into a user input device; 

(ii) at least temporarily storing said information; Mn ,_.. to 70Q1 
(ill) comparing the at least one amino acid sequence information selected from SEQ ID NOS.3502 to 7001 
with the taraet sequence or target structure motif Information; and 

M £3g and q anatyzing amino acid sequence information which is coincident with or analogous to the 
target sequence or target structure motif information. 

27 A system based on a computer for determining a function of a polypeptide encoded by a polynucleotide having a 
target nucleotide sequence derived from a coryneform bacterium, comprising the following: 

(I) a user input device that inputs at least one nucleotide sequence Information selected from SEQ ID . NOS:2 
io 3501. function information of a polypeptide encoded by the nucleotide sequence, and target nucleotide 
sequence information; 

(in a data storage device for at least temporarily storing the input information; 

(Of) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS_ 
2 to 3501 with the target nucleotide sequence information for determining a function of a polypeptide encoded 
by a polynucleotide having the target nucleotide sequence which is coincident w.th or analogous to the poly- 
nucleotide having at least one nucleotide sequence selected from SEQ ID NOSS to 3501 ; and 
(iv) an output devices that shows a function obtained by the comparator. 

28 A method based on a computerfor determining a function of a polypeptide encoded by a polypeptide encoded by 
a Polynucleotide having a target nucleotide sequence derived from a coryneform bacterium, comprls.ng the fol- 
towing: 

(I) inputting at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501 , function , in- 
foZtlon of a polypeptide encoded by the nucleotide sequence, and target nucleotide sequence Information, 
(ii) at least temporarily storing said infoimation; Mnco tn "«oi with 

(Hi) comparing the at least one nucleotide sequence information selected from SEQ ID NOS.2 to 3501 with 

the taraet nucleotide sequence information; and 

(iv) determining a function of a polypeptide encoded by a polynucleotide having the target nucleot.de sequence 
with or analogous to the polynucleotide having at least one nucleotide sequence selected 
from SEQ ID NOS:2 to 3501 . 

29. A system based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , function information based on the amino acid sequence, and target ammo acid sequence infer- 
mation; 
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(ii) a data storing device for at least temporarily storing the Input Information; 

(III) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target amino acid sequence information for determining a function of a polypeptide 
having the target amino acid sequence which Is coincident with or analogous to the polypeptide having at least 
5 one amino acid sequence selected from SEQ ID NOS:3502 to 7001.; and 

(iv) an output device that shows a function obtained by the comparator. 

30. A method based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

10 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , function 
information based on the amino acid sequence, and target amino acid sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
15 with the target amino acid sequence information; and 

(iv) determining a function of a polypeptide having the target amino acid sequence which is coincident with or 
analogous to the polypeptide having at least one amino acid sequence selected from SEQ ID NOS:3502 to 
7001. 

20 31 . The system according to any one of claims 23, 25, 27 and 29, wherein a coryneform bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

32. The method according to any one of claims 24, 26, 28 and 30, wherein a coryneform bacterium Is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

25 

33. The system according to claim 31 , wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium giutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetoglutamicum, Corynebacterium callunae, Corynebacterium hercuiis, Corynebacterium liiium, Corynebacteri- 
um meiassecoia t Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

30 

34. The method according to claim 32, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetogiutamicum, Corynebacterium caiiunae, Corynebacterium hercuiis, Corynebacterium /ilium, Corynebacteri- 
um melassecoia, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

35 

35. A recording medium or storage device which is readable by a computer in which at least one nucleotide sequence 
information selected from SEQ ID NOS:1 to 3501 or function information based on the nucleotide sequence is 
recorded, and is usable in the system of claim 23 or 27 or the method of claim 24 or 28. 

40 36. A recording medium or storage device which is readable by a computer in which at least one amino acid sequence 
information selected from SEQ ID NOS:3502 to 7001 or function information based on the amino acid sequence 
is recorded, and is usable in the system of claim 25 or 29 or the method of claim 26 or 30. 

37. The recording medium or storage device according to claim 35 or 36, which is a computer readable recording 
45 medium selected from the group consisting of a floppy disc, a hard disc, a magnetic tape, a random access memory 

(RAM), a read only memory (ROM), a magneto-optic disc (MO), CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM 
and DVD-RW. 

38. A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the Val 
so residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a coryneform bacterium 

is replaced with an amino acid residue other than a Val residue. 

39. A polypeptide comprising an amino acid sequence in which the Val residue at the 59th position in the amino acid 
sequence as represented by SEQ ID NO:6952 is replaced with an amino acid residue other than a Val residue. 

55 

40. The polypeptide according to claim 38 or 39, wherein the Val residue at the 59th position is replaced with an Ala 
residue. 
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41. A polypeptide having pyruvate carboxylase activity, comprising an amino acid sequence in which the Pro residue 
at the 458th position in the amino acid sequence of pyruvate carboxylase derived from a coryneform bacterium is 
replaced with an amino acid residue other than a Pro residue. 

5 42. A polypeptide comprising an amino acid sequence in which the Pro residue at the 458th position in the amino acid 
sequence represented by SEQ ID NO:4265 is replaced with an amino acid residue other than a Pro residue. 

43. The polypeptide according to claim 41 or 42, wherein the Pro residue at the 458th position is replaced with a Ser 
residue. 

10 

44. The polypeptide according to any one of claims 38 to 43, which is derived from Corynebactehum glutamicum. 

45. A DNA encoding the polypeptide of any one of claims 38 to 44. 
15 AS. A recombinant DNA comprising the DNA of claim 45. 

47. A transformant comprising the recombinant DNA of claim 46. 

48. A transformant comprising in its chromosome the DNA of claim 45. 

20 

49. The transformant according to claim 47 or 48, which is derived from a coryneform bacterium. 

50. The transformant according to claim 49, which is derived from Corynebacterium glutamicum. 
25 51 . A method for producing L-lysine, comprising: 

culturing the transformant of any one of claims 47 to 50 in a medium to produce and accumulate L-lysine in 
the medium, and 

recovering the L-lysine from the culture. 

30 

52. A method for breeding a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:1 to 3431 , comprising the following: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
35 rium which has been subjected to mutation breeding so as to produce at least one compound selected from 

an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 
method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtained by (I); 

(iii) introducing the mutation point into a coryneform bacterium which is free of the mutation point, or deleting 
40 the mutation point from a coryneform bacterium having the mutation point; and 

(iv) examining productivity by the fermentation method of the compound selected In (I) of the coryneform 
bacterium obtained in (iii). 

53. The method according to claim 52, wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
45 a signal transmission pathway. 

54. The method according to claim 52, wherein the mutation point is a mutation point relating to a useful mutation 
which improves or stabilizes the productivity. 

so 55. A method for breading a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:1 to 3431, comprising: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 

55 an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 

method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtain by (i); 

(iii) deleting a mutation point from a coryneform bacterium having the mutation point; and 
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(Iv) examining productivity by the fermentation method of the compound selected in (0 of the coryneform 
bacterium obtained in (Hi). 

56. The method according to claim 55, wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
a signal transmission pathway. 

57. The method according to claim 55, wherein the mutation point is a mutation point which decreases or destabilizes 
the productivity. 

58. A method for breeding a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:2 to 3431, comprising the following: 

(i) identifying an isozyme relating to biosynthesis of at least one compound selected from an amino acid, a 
nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 
quence information represented by SEQ ID NOS:2 to 3431 ; 

(ii) classifying the isozyme identified In (i) into an isozyme having the same activity; 

(iii) mutating all genes encoding the isozyme having the same activity simultaneously; and 

(Iv) examining productivity by a fermentation method of the compound selected in (i) of the coryneform bac- 
terium which have been transformed with the gene obtained in (ill). 

59. A method for breeding a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:2 to 3431 , comprising the following: 

(i) arranging a function information of an open reading frame (ORF) represented by SEQ ID NOS:2 to 3431; 

(ii) allowing the arranged ORF to correspond to an enzyme on a known biosynthesis or signal transmission 
pathway; 

(iii) explicating an unknown biosynthesis pathway or signal transmission pathway of a coryneform bacterium 
in combination with information relating known biosynthesis pathway or signal transmission pathway of a co- 
ryneform bacterium; 

(iv) comparing the pathway explicated in (iii) with a biosynthesis pathway of a target useful product; and 

(v) transgenetically varying a coryneform bacterium based on the nucleotide sequence information to either 
strengthen a pathway which is judged to be important In the biosynthesis of the target useful product In (iv) or 
weaken a pathway which is judged not to be important in the biosynthesis of the target useful product in (iv). 

60. A coryneform bacterium, bred by the method of any one of claims 52 to 59. 

61. The coryneform bacterium according to claim 60, which is a microorganism belonging to the genus Corynebacte- 
rium, the genus Brevibacterium t or the genus M/crobacterium. 

62. The coryneform bacterium according to claim 61 , wherein the microorganism belonging to the genus Corynebac- 
terium is selected from the group consisting of Corynebacterium gtutamlcum, Corynebacterium acetoaddophilum, 
Corynebacterium acetoglutamicum, Corynebacterium caiiunae, Corynebacterium herculis, corynebacterium tih 
ium, Corynebacterium melassecola, Corynebacterium thermoamino genes, and Corynebacterium ammonia 
genes. 

63. A method for producing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 
an organic acid and an analogue thereof, comprising: 

culturing a coryneform bacterium of any one of claims 60 to 62 in a medium to produce and accumulate at 
least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, 
and analogues thereof; 
recovering the compound from the culture. 

64. The method according to claim 63, wherein the compound is L-lysine. 

65. A method for identifying a protein relating to useful mutation based on proteome analysis, comprising the following: 

(i) preparing 
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a protein derived from a bacterium of a production strain of a coryneform bacterium which has been sub- 
iected to mutation breeding by a fermentation process so as to produce at least one compound selected 
from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof, and 
a protein derived from a bacterium of a parent strain of the production strain; 

(ii) separating the proteins prepared in (i) by two dimensional electrophoresis; 

(iii) detecting the separated proteins, and comparing an expression amount of the protein derived from the 
production strain with that derived from the parent strain; . a 
(rv) treating the protein showing different expression amounts as a result of the companson with a peptidase 
to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained in (rv); and 

(vi) comparing the amino acid sequences obtained in (v) with the amino acid sequence represented by SEQ 
ID NOS:3502 to 7001 to identifying the protein having the amino acid sequences. 

66. The method according to claim 65, wherein the coryneform bacterium is a microorganism belonging to the genus 
corynebacterium, the genus Brevibacterium, or the genus Microbacterium, 

67 The method according to claim 66, wherein the microorganism belonging to the genus Corynebacterium is selected 
" from the group consisting of Corynebacterium gtutamicum, Corynebacterium acetoacidophilum, Corynebactenum 
acetogluiamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebactenum htium, Corynebacteri- 
um melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

68. A biologically pure culture of Corynebacterium glutamicum AHP-3 (FERM BP-7382) . 



242 



EP1 108 790 A2 




243 



EP1 108 790 A2 




244 



EP1 108 790 A2 




245 



EP1 108 790 A2 



FIG. 4 
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